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METHODS OF DIAGNOSIS OF BLADDER CANCER, COMPOSITIONS AND 
METHODS OF SCREENING FOR MODULATORS OF BLADDER CANCER 

CROSS-REFERENCES TO RELATED APPLICATIONS 
5 This application is related to USSN 60/302,814, filed July 3, 2001; USSN 60/310,099, 

filed August 3, 2001; USSN 60/343,705, filed November 8, 2001; USSN 60/350,666, filed 
November 13, 2001; and USSN 60/372,246, filed April 12, 2001, each of which is 
incorporated herein by reference. 

1 0 FIELD OF THE INVENTION 

The invention relates to the identification of nucleic acid and protein expression 
profiles and nucleic acids, products, and antibodies thereto that are involved in bladder 
cancer; and to the use of such expression profiles and compositions in the diagnosis, 
prognosis, and therapy of bladder cancer. The invention further relates to methods for 
1 5 identifying and using agents and/or targets that inhibit bladder cancer. 

BACKGROUND OF THE INVENTION 
In the United States, over 50,000 new cases of bladder cancer are diagnosed annually, 
and more than 10,000 deaths will be attributed to bladder cancer. Bladder cancer is now the 
20 fourth most common cancer among American men and the ninth most common cancer 

among American women. It occurs three times more frequently in men than in women, and it 
occurs roughly twice more frequently in white versus black men. 

Bladder cancer rarely occurs in people younger than 40 years of age, being primarily 
a disease of older men. Nonetheless, bladder cancer is a significant cause of illness and death 
25 in the United States. The risk of bladder cancer increases steeply with age, with over half of 
all bladder cancer deaths occurring after age 70. In white men older than 65, the annual 
disease rate of bladder cancer is approximately 2 cases per 1,000 persons; this contrasts with 
a rate of 0.1 cases per 1,000 persons younger than 65. 
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Within the United States, bladder cancer rates are higher among people who reside in 
northern versus southern states, and is higher for people who live in urban versus rural areas. 
Although this difference suggests that environmental as well as genetic factors may 
contribute to the development and progression of the disease, other studies confirm that 
5 certain genes play a role in bladder cancer.. For example, expression of the tumor suppressor 
gene p53 has been associated with an adverse prognosis for patients with invasive bladder 
cancer. A retrospective study of 243 patients treated by radical cystectomy found that the 
presence of nuclear p53 was an independent predictor for recurrence among patients with mid 
to late stage tumors. Esrig, et al (1994) N.E.J. Med. 331 :1259-64. 

10 Urinary bladder cancers represent a spectrum of diseases that can be grouped into . 

three general categories: superficial, invasive, and metastatic. The prognosis for treatment is 
highly dependent on the stage at which the tumor is first diagnosed. A unique aspect of 
bladder cancer treatment is that repeated surgical biopsy is an integral part of routine patient 
management. This has permitted the conduct of molecular genetic studies of tumors from 

15 specific stages of the disease. The results of these studies suggest that bladder cancers 

develop and progress along at least two discrete pathways, which may account for differences 
in invasiveness and metastatic potential. Incorporating molecular genetic factors into the 
current paradigm for diagnosis and treatment will optimize the probability of cure and allow 
the quality of life for bladder cancer patients to be maintained. 

20 Early detection and treatment can prevent reoccurrence and progression of the disease 

to an incurable stage. Thus, the identification of novel diagnostic markers and therapeutic 
targets will improve the current treatment of bladder cancer patients. While industry and 
academia have identified novel sequences, there has not been an equal effort exerted to 
identify the function of these novel sequences in disease states. The elucidation of a role for 

25 novel proteins and compounds in disease states for identification of diagnostic markers and 
therapeutic targets is essential for improving the current treatment of bladder cancer patients. 
Accordingly, provided herein are methods that can be used in diagnosis and prognosis of 
bladder cancer. Additionally, provided herein are molecular targets for therapeutic 
intervention in bladder cancer and other related bladder diseases.. Further provided are 
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methods that can be used to screen candidate bioactive agents for the ability to modulate 
bladder cancer. 

SUMMARY OF THE INVENTION 
5 The present invention therefore provides nucleotide sequences of genes that are up- 

and down-regulated in bladder cancer cells. Such genes are useful for diagnostic purposes, 
and also as targets for screening for therapeutic compounds that modulate bladder cancer, 
such as hormones or antibodies. Other aspects of the invention will become apparent to the 
skilled artisan by the following description of the invention. 
10 In one aspect, the present invention provides a method of detecting a bladder cancer- 

associated transcript in a cell from a patient, the method comprising contacting a biological 
sample from the patient with a polynucleotide that selectively hybridizes to a sequence at 
least 80% identical to a sequence as shown in Tables 1A-13. 

In one embodiment, the present invention provides a method of determining the level 
15 of a bladder cancer associated transcript in a cell from a patient. 

In one embodiment, the present invention provides a method of detecting a bladder 
cancer-associated transcript in a cell from a patient, the method comprising contacting a 
biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1 A-13. 
20 In one embodiment, the polynucleotide selectively hybridizes to a sequence at least 

95% identical to a sequence as shown in Tables 1 A-13. 

In one embodiment, the biological sample is a tissue sample. In another embodiment, 
the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent label. 
25 In one embodiment, the polynucleotide is immobilized on a solid surface. 

In one embodiment, the patient is undergoing a therapeutic regimen to treat bladder 
cancer. In another embodiment, the patient is suspected of having metastatic bladder cancer. 
In one embodiment, the patient is a human. 

In one embodiment, the bladder cancer associated transcript is mRNA. 



3 



WO 03/003906 



PCT/US02/21338 



In one embodiment, the method further comprises the step of amplifying nucleic acids 
before the step of contacting the biological sample with the polynucleotide. 

In another aspect, the present invention provides a method of monitoring the efficacy 
of a therapeutic treatment of bladder cancer, the method comprising the steps of: (i) providing 
5 a biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 
the level of a bladder cancer-associated transcript in the biological sample by contacting the 
biological sample with a polynucleotide that selectively hybridizes to a sequence at least 80% 
identical to a sequence as shown in Tables 1A-13, thereby monitoring the efficacy of the 
therapy. In a further embodiment, the patient has metastatic bladder cancer. In a further 
10 embodiment, the patient has a drug resistant form of bladder cancer. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the bladder cancer-associated transcript to a level of the bladder cancer-associated 
transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 
treatment. 

1 5 Additionally, provided herein is a method of evaluating the effect of a candidate 

bladder cancer drug comprising administering the drug to a patient and removing a cell 
sample from the patient. The expression profile of the cell is then determined. This method 
may further comprise comparing the expression profile to an expression profile of a healthy 
individual In a preferred embodiment, said expression profile includes a gene of Tables 1A- 

20 13. 

In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1 A-13. 

In one embodiment, an expression vector or cell comprises the isolated nucleic acid. 
In one aspect, the present invention provides an isolated polypeptide which is encoded 
25 by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1 A-13. 

In another aspect, the present invention provides an antibody that specifically binds to 
an isolated polypeptide which is encoded by a nucleic acid molecule having polynucleotide 
sequence as shown in Tables 1A-13. 

In one embodiment, the antibody is conjugated to an effector component, e.g., a 
30 fluorescent label, a radioisotope or a cytotoxic chemical. 
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In one embodiment, the antibody is an antibody fragment. In another embodiment, 
the antibody is humanized. 

In one aspect, the present invention provides a method of detecting a bladder cancer 
cell in a biological sample from a patient, the method comprising contacting the biological 
5 sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting antibodies 
specific to bladder cancer in a patient, the method comprising contacting a biological sample 
from the patient with a polypeptide encoded by a nucleic acid comprising a sequence from 
Tables 1A-13. 

10 In another aspect, the present invention provides a method for identifying a compound 

that modulates a bladder cancer-associated polypeptide, the method comprising the steps of: 
(i) contacting the compound with a bladder cancer-associated polypeptide, the polypeptide 
encoded by a polynucleotide that selectively hybridizes to a sequence at least 80% identical 
to a sequence as shown in Tables 1 A- 13; and (ii) determining the functional effect of the 

1 5 compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic effect, or a 
chemical effect. 

In one embodiment, the polypeptide is expressed in a eukaryotic host cell or cell 
membrane. In another embodiment, the polypeptide is recombinant. 
20 In one embodiment, the functional effect is determined by measuring ligand binding 

to the polypeptide. 

In another aspect, the present invention provides a method of inhibiting proliferation 
of a bladder cancer-associated cell to treat bladder cancer in a patient, the method comprising 
the step of administering to the subject a therapeutically effective amount of a compound 
25 identified as described herein. 

In one embodiment, the compound is an antibody. 

In another aspect, the present invention provides a drug screening assay comprising 
the steps of: (i) administering a test compound to a mammal having bladder cancer or to a 
cell sample isolated therefrom; (ii) comparing the level of gene expression of a 
30 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

5 
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as shown in Tables 1 A- 13 in a treated cell or mammal with the level of gene expression of the 
polynucleotide in a control cell sample or mammal, wherein a test compound that modulates 
the level of expression of the polynucleotide is a candidate for the treatment of bladder 
cancer. 

5 In one embodiment, the control is a mammal with bladder cancer or a cell sample 

therefrom that has not been treated with the test compound. In another embodiment, the 
control is a normal cell or mammal. 

In one embodiment, the test compound is administered in varying amounts or 
concentrations. In another embodiment, the test compound is administered for varying time 
10 periods. Li another embodiment, the comparison can occur after addition or removal of the 
drug candidate. 

In one embodiment, the levels of a plurality of polynucleotides that selectively 
hybridize to a sequence at least 80% identical to a sequence as shown in Tables 1A-13 are 
individually compared to their respective levels in a control cell sample or mammal. In a 
1 5 preferred embodiment the plurality of polynucleotides is from three to ten. 

In another aspect, the present invention provides a method for treating a mammal 
having bladder cancer comprising administering a compound identified by the assay 
described herein. 

In another aspect, the present invention provides a pharmaceutical composition for 
20 treating a mammal having bladder cancer, the composition comprising a compound identified 
by the assay described herein and a physiologically acceptable excipient. 

In one aspect, the present invention provides a method of screening drug candidates 
by providing a cell expressing a gene that is up- and down-regulated as in a bladder cancer. 
In one embodiment, a gene is selected from Tables 1 A-13. The method further includes 
25 adding a drug candidate to the cell and determining the effect of the drug candidate on the 
expression of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes comparing the 
level of expression in the absence of the drug candidate to the level of expression in the 
presence of the drug candidate, wherein the concentration of the drug candidate can vary 
30 when present, and wherein the comparison can occur after addition or removal of the drug 
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candidate. In a preferred embodiment, the cell expresses at least two expression profile 
genes. The profile genes may show an increase or decrease. 

Also provided is a method of evaluating the effect of a candidate bladder cancer drug 
comprising administering the drug to a transgenic animal expressing or over-expressing the 
5 bladder cancer modulatory protein, or an animal lacking the bladder cancer modulatory 
protein, for example as a result of a gene knockout. 

Moreover, provided herein is a biochip comprising one or more nucleic acid segments 
of Tables 1A-13, wherein the biochip comprises fewer than 1000 nucleic acid probes. 
Preferably, at least two nucleic acid segments are included. More preferably, at least three 
10 nucleic acid segments are included. 

Furthermore, a method of diagnosing a disorder associated with bladder cancer is 
provided. The method comprises determining the expression of a gene of Tables 1 A- 13 in a 
first tissue type of a first individual, and comparing the distribution to the expression of the 
gene from a second normal tissue type from the first individual or a second unaffected 
1 5 individual. A difference in the expression indicates that the first individual has a disorder 
associated with bladder cancer. 

In a further embodiment, the biochip also includes a polynucleotide sequence of a 
gene that is not up- and down-regulated in bladder cancer. 

In one embodiment a method for screening for a bioactive agent capable of interfering 
20 with the binding of a bladder cancer modulating protein (bladder cancer modulatory protein) 
or a fragment thereof and an antibody which binds to said bladder cancer modulatory protein 
or fragment thereof. In a preferred embodiment, the method comprises combining a bladder 
cancer modulatory protein or fragment thereof, a candidate bioactive agent and an antibody 
which binds to said bladder cancer modulatory protein or fragment thereof. The method 
25 further includes determining the binding of said bladder cancer modulatory protein or 
fragment thereof and said antibody. Wherein there is a change in binding, an agent is 
identified as an interfering agent. The interfering agent can be an agonist or an antagonist. 
Preferably, the agent inhibits bladder cancer. 

Also provided herein are methods of eliciting an immune response in an individual. 
30 In one embodiment a method provided herein comprises administering to an individual a 
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composition comprising a bladder cancer modulating protein, or a fragment thereof. In 
another embodiment, the protein is encoded by a nucleic acid selected from those of Tables 
1A-13. 

Further provided herein are compositions capable of eliciting an immune response in 
5 an individual. In one embodiment, a composition provided herein comprises a bladder cancer 
modulating protein, preferably encoded by a nucleic acid of Tables 1 A- 13 or a fragment 
thereof, and a pharmaceutical^ acceptable carrier. In another embodiment, said composition 
comprises a nucleic acid comprising a sequence encoding a bladder cancer modulating 
protein, preferably selected from the nucleic acids of Tables 1A-13, and a pharmaceutically 

10 acceptable carrier. 

Also provided are methods of neutralizing the effect of a bladder cancer protein, or a 
fragment thereof, comprising contacting an agent specific for said protein with said protein in 
an amount sufficient to effect neutralization. In another embodiment, the protein is encoded 
by a nucleic acid selected from those of Tables 1A-13. 

1 5 In another aspect of the invention, a method of treating an individual for bladder 

cancer is provided. In one embodiment, the method comprises administering to said 
individual an inhibitor of a bladder cancer modulating protein. In another embodiment, the 
method comprises administering to a patient having bladder cancer an antibody to a bladder 
cancer modulating protein conjugated to a therapeutic moiety. Such a therapeutic moiety can 

20 be a cytotoxic agent or a radioisotope. 

DETAILED DESCRIPTION OF THE INVENTION 
In accordance with the objects outlined above, the present invention provides novel 
methods for diagnosis and prognosis evaluation for bladder disease (BD), e.g., cancer, 
25 including metastatic bladder cancer, as well as methods for screening for compositions which 
modulate bladder diseases. Also provided are methods and compositions for treating bladder 
disease. Various related conditions where these markers may be useful also, include, e.g., 
carcinoma in situ, various stages of papillary carcinomas; and such conditions in different 
stages, layers, structural portions, etc. 
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Recent advances in molecular medicine, generally, have increased the interest in 
tumor-specific cell surface antigens that could serve as diagnostic or prognostic markers, or 
as targets for various immunotherapeutic or small molecule strategies. Antigens suitable for 
immunotherapeutic strategies should be highly expressed in cancer tissues and ideally not 

5 expressed in other, e.g., normal, adult tissues. Expression in tissues that are dispensable for 
life, however, may be tolerated, as a physiological consequence of such expression would be 
limited. Examples of such antigens in cancers other than bladder cancer include Her2/neu 
and the B-cell antigen CD20. Humanized monclonal antibodies directed to Her2/neu 
(Herceptin®/trastuzumab) are currently in use for the treatment of metastatic breast cancer. 

10 Ross and Fletcher (1998) Stem Cells 16:413-428. Similarly, anti-CD20 monoclonal 
antibodies (Rituxm®/rituximab) are used to effectively treat non-Hodgkin's lymphoma. 
Maloney, et al. (1997) Blood 90:2188-2195; and Leget and Czuczman (1998) Curr. Opin. 
Oncol. 10:548-551. 

15 Definitions 

The term "bladder cancer protein" or "bladder cancer polynucleotide" or "bladder 
cancer-associated transcript" refers to nucleic acid and polypeptide polymorphic variants, 
alleles, mutants, and interspecies homologues that: (1) have a nucleotide sequence that has 
greater than about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, 

20 preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater nucleotide 

sequence identity, preferably over a region of over a region of at least about 25, 50, 100, 200, 
500, 1000, or more nucleotides, to a nucleotide sequence of or associated with a gene of 
Tables 1 A-13; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an 
immunogen comprising an amino acid sequence encoded by a nucleotide sequence of or 

25 associated with a gene of Tables 1 A-l 3, and conservatively modified variants thereof; (3) 
specifically hybridize under stringent hybridization conditions to a nucleic acid sequence, or 
the complement thereof of Tables 1A-13 and conservatively modified variants thereof; or (4) 
have an amino acid sequence that has greater than about 60% amino acid sequence identity, 
65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 

30 99% or greater amino sequence identity, preferably over a region of over a region of at least 
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about 25, 50, 100, 200, 500, 1000, or more amino acid, to an amino acid sequence encoded 
by a nucleotide sequence of or associated with a gene of Tables 1A-13. A polynucleotide or 
polypeptide sequence is typically from a mammal including, but not limited to, primate, e.g., 
human; rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or other mammal. A 
5 "bladder cancer polypeptide" and a "bladder cancer polynucleotide," include both naturally 
occurring or recombinant forms. 

A "full length" bladder cancer protein or nucleic acid refers to a bladder cancer 
polypeptide or polynucleotide sequence, or a variant thereof, that contains all of the elements 
normally contained in one or more naturally occurring, wild type bladder cancer 

1 0 polynucleotide or polypeptide sequences. The "full length" may be prior to, or after, various 
stages of splicing, including alternative splicing, or post-translation processing. 

"Biological sample" as used herein is a sample of biological tissue or fluid, e.g., that 
contains nucleic acids or polypeptides of a bladder cancer protein, polynucleotide, or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 

15 humans, or rodents, e.g., mice and rats. Biological samples may also include sections of 
tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 
blood, plasma, serum, sputum, stool, urine, tears, mucus, hair, skin, etc. Biological samples 
also include explants and primary and/or transformed cell cultures derived from patient 
tissues. A biological sample is typically obtained from a eukaryotic organism, most 

20 preferably a mammal such as a primate, e.g., chimpanzee or human; cow; dog; cat; a rodent, 
e.g., guinea pig, rat, or mouse; rabbit; or a bird; reptile; or fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 

25 isolated by another person, at another time, and/or for another purpose), or by performing the 
methods of the invention in vivo. Archival tissues, having treatment or outcome history, will 
be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more nucleic acids 
or polypeptide sequences, refer to two or more sequences or subsequences that are the same 

30 or have a specified percentage of amino acid residues or nucleotides that are the same (e.g., 

10 



WO 03/003906 



PCT/US02/21338 



about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and aligned 
for maximum correspondence over a comparison window or designated region) as measured 
using a BLAST or BLAST 2.0 sequence comparison algorithms with default parameters 
5 described below, or by manual alignment and visual inspection (see, e.g., NCBI web site 
http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to be 
"substantially identical." This definition also refers to, or may be applied to, the compliment 
of a test sequence. The definition also includes sequences that have deletions and/or 
additions, substitutions, naturally occurring variants, e.g., polymorphic or allelic, and man- 

1 0 made variants. As described below, the preferred algorithms can account for gaps and the 
like. Preferably, identity exists over a region that is at least about 25 amino acids or 
nucleotides in length, or more preferably over a region that is 50-100 amino acids or 
nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, to 

1 5 which test sequences are compared. When using a sequence comparison computer algorithm, 
test and reference sequences, subsequence coordinates, and sequence algorithm program 
parameters are typically designated. Default or alternative program parameters can be 
selected. The sequence comparison algorithm then calculates the percent sequence identities 
for the test sequences relative to the reference sequence, based on the program parameters. 

20 A "comparison window", as used herein, includes reference to a segment of one of the 

number of contiguous positions selected from the group consisting typically of from about 
20-600, usually about 50-200, more usually about 100-150 in which a sequence may be 
compared to a reference sequence of the same number of contiguous positions after the two 
sequences are optimally aligned. Methods of alignment of sequences for comparison are 

25 well-known in the art. Optimal alignment of sequences for comparison can be conducted, by, 
e.g., the local homology algorithm of Smith and Waterman (1981) Adv. AppL Math. 2:482, 
the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48:443- 
453, the search for similarity method of Pearson and Lipman (1988) Proc. Nat'l Acad. Sci. 
US A 85:2444-448, computerized implementations of these algorithms (GAP, BESTFIT, 

30 FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 
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Group, 575 Science Dr., Madison, WI), or manual alignment and visual inspection (see, e.g., 
Ausubel, et al. (eds. 1995 and supplements) Current Protocols in Molecular Biology 
Lippincott. 

Preferred algorithms suitable for determining percent sequence identity and sequence 
5 similarity include the BLAST and BLAST 2.0 algorithms. See Altschul, et al. (1977) Nuc. 
Acids Res. 25:3389-3402; and Altschul, et al (1990) J. Mol. Biol. 215:403-410. BLAST and 
BLAST 2.0 are used, with the parameters described herein, to determine percent sequence 
identity for the nucleic acids and proteins of the invention. Software for performing BLAST 
analyses is publicly available through the National Center for Biotechnology Information 

10 (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring 

sequence pairs (HSPs) by identifying short words of length W in the query sequence, which 
either match or satisfy some positive- valued threshold score T when aligned with a word of 
the same length in a database sequence. T is referred to as the neighborhood word score 
threshold (Altschul, et al., supra). These initial neighborhood word hits act as seeds for 

1 5 initiating searches to find longer HSPs containing them. The word hits are extended in both 
directions along each sequence for as far as the cumulative alignment score can be increased. 
Cumulative scores are calculated using, e.g., for nucleotide sequences, the parameters M 
(reward score for a pair of matching residues; always > 0) and N (penalty score for 
mismatching residues; always < 0). For amino acid sequences, a scoring matrix is used to 

20 calculate the cumulative score. Extension of the word hits in each direction are halted when: 
the cumulative alignment score falls off by the quantity X from its maximum achieved value; 
the cumulative score goes to zero or below, due to the accumulation of one or more negative- 
scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN 

25 program (for nucleotide sequences) uses as defaults a wordlength (W) of 1 1 , an expectation 
(E) of 10, M=5, N=-4, and a comparison of both strands. For amino acid sequences, the 
BLASTP program uses as defaults a wordlength of 3, expectation (E) of 10, and the 
BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Nat'l Acad. Sci. USA 
89:10915-919) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a comparison of 

30 both strands. 
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The BLAST algorithm also performs a statistical analysis of the similarity between 
two sequences (see, e.g., Karlin and Altschul (1993) Proc. Natl Acad. Sci. USA 90:5873- 
5787). One measure of similarity provided by the BLAST algorithm is the smallest sum 
probability (P(N)), which provides an indication of the probability by which a match between 
5 two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid 
is considered similar to a reference sequence if the smallest sum probability in a comparison 
of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably 
less than about 0.01 , and most preferably less than about 0.001 . Log values may be large 
negative numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 110, 150, 170, etc. 

10 An indication that two nucleic acid sequences or polypeptides are substantially 

identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 
reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, e.g., where the two peptides differ only by conservative substitutions. Another 

1 5 indication that two nucleic acid sequences are substantially identical is that the two molecules 
or their complements hybridize to each other under stringent conditions, as described below. 
Yet another indication that two nucleic acid sequences are substantially identical is that the 
same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 

20 expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g., the American Type Culture 
Collection catalog or web site, www.atcc.org). 

25 The terms "isolated," "purified," or "biologically pure" refer to material that is 

substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 
chromatography. A protein or nucleic acid that is the predominant species present in a 

30 preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
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some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments typically denotes that a 
nucleic acid or protein gives rise to essentially one band in an electrophoretic gel. Preferably, 
it means that the nucleic acid or protein is at least 85% pure, more preferably at least 95% 
5 pure, and most preferably at least 99% pure. "Purify" or "purification" in other embodiments 
means removing at least one contaminant from the composition to be purified. In this sense, 
purification does not require that the purified compound be homogeneous, e.g., 100% pure. 

The terms "polypeptide," "peptide," and "protein" are used interchangeably herein to 
refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which 

10 at least one amino acid residue is an artificial chemical mimetic of a corresponding naturally 
occurring amino acid, as well as to naturally occurring amino acid polymers, those containing 
modified residues, and a non-naturally occurring amino acid polymer. 

The term "amino acid" embraces naturally occurring or synthetic amino acids, amino 
acid analogs, and amino acid mimetics. Naturally occurring amino acids are those encoded 

15 by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, 
y-carboxyglutamate, and O-phosphoserine. Amino acid analogs include compounds that 
share a basic chemical structure with a naturally occurring amino acid, e.g., an a carbon that 
is bound to a hydrogen, a carboxyl group, an amino group, or an R group, e.g., homoserine, 
norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 

20 modified R groups (e.g., norleucine) or modified peptide backbones, but share some basic 
chemical structure with a naturally occurring amino acid. Amino acid mimetics include 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that function similarly to a naturally occurring amino acid. 

Amino acids may be referred to herein by their commonly known three letter symbols 

25 or by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature 
Commission. Nucleotides, likewise, may be referred to by their commonly accepted single- 
letter codes. 

"Conservatively modified variants" applies to amino acid or nucleic acid sequences. 
With respect to particular nucleic acid sequences, conservatively modified variants refers to 
30 those nucleic acids which encode identical or essentially identical amino acid sequences, or 
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where the nucleic acid does not encode an amino acid sequence, to essentially identical or 
associated, e.g., naturally contiguous, sequences. Because of the degeneracy of the genetic 
code, a large number of functionally identical nucleic acids encode most proteins. For 
instance, the codons GCA, GCC, GCG, and GCU all encode the amino acid alanine. Thus, at 

5 " each position where an alanine is specified by a codon, the codon can be altered to another of 
the corresponding codons described without altering the encoded polypeptide. Such nucleic 
acid variations are "silent variations," which are one species of conservatively modified 
variations. Each nucleic acid sequence herein which encodes a polypeptide also describes 
silent variations of the nucleic acid. One of skill will recognize that in certain contexts each 

10 codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and 
TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a 
functionally identical molecule. Accordingly, often silent variations of a nucleic acid which 
encodes a polypeptide is implicit in a described sequence with respect to the expression 
product, but not with respect to actual probe sequences. 

15 As to amino acid sequences, one of skill will recognize that individual substitutions, 

deletions, or additions to a nucleic acid, peptide, polypeptide, or protein sequence which 
alters, adds, or deletes a single amino acid or a small percentage of amino acids in the 
encoded sequence is a "conservatively modified variant" where the alteration results in the 
substitution of an amino acid with a chemically similar amino acid. Conservative substitution 

20 tables providing functionally similar amino acids are well known in the art. Such 

conservatively modified variants are in addition to and do not exclude polymorphic variants, 
interspecies homologs, and alleles of the invention. Typically conservative substitutions for 
one another include: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid (E); 3) 
Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), Leucine (L), 

25 Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 7) Serine 
(S), Threonine (T); and 8) Cysteine (C), Methionine (M). See, e.g., Creighton (1984) 
Proteins: Structure and Molecular Properties Freeman. 

Macromolecular structures such as polypeptide structures can be described in terms of 
various levels of organization. See, e.g., Alberts, et al. (eds. 2001) Molecular Biology of the 

30 Cell (4th ed.) Garland; and Cantor and Schimmel (1980) Biophysical Chemistry Part I: The 
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Conformation of Biological Macromolecules Freeman, "Primary structure" refers to the 
amino acid sequence ofa particular peptide, "Secondary structure" refers to locally ordered, 
three dimensional structures within a polypeptide. These structures are commonly known as 
domains, which are portions of a polypeptide that often form a compact unit of the 
5 polypeptide, and are typically about 25-500 amino acids long. Typical domains are made of 
sections of lesser organization such as stretches of p-sheet and a-helices. "Tertiary structure" 
refers to the complete three dimensional structure of a polypeptide monomer. "Quaternary 
structure" refers to the three dimensional structure formed, usually by the nbncovalent 
association of independent tertiary units. Anisotropic terms are also known as energy terms. 

10 "Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical equivalents 

used herein means at least two nucleotides covalently linked together. Oligonucleotides are 
typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more nucleotides in length, up 
to about 100 nucleotides in length. Nucleic acids and polynucleotides are polymers, 
including longer lengths, e.g., 200, 300, 500, 1000, 2000, 3000, 5000, 7000, 10,000, etc. A 

1 5 nucleic acid of the present invention will generally contain phosphodiester bonds. In some 
cases, nucleic acid analogs are included that may have alternate backbones, e.g., 
phosphoramidate (Beaucage, et al. (1993) Tetrahedron 49:1925-963 and references therein; 
Letsinger (1970) J. Org. Chem. 35:3800-803; Sprinzl, et al. (1977) Eur. J. Biochem. 81:579- 
589; Letsinger, et al. (1986) Nucl. Acids Res. 14:3487-499; Sawai, et al. (1984) Chem. Lett. 

20 805; Letsinger, et al. (1988) J. Am. Chem. Soc. 1 10:4470-471; and Pauwels, et al. (1986) 
Chemica Scripta 26:141-149); phosphorothioate (Mag, et al. (1991), Nucleic Acids Res. 
19:1437-441; and U.S. Patent No. 5,644,048); phosphorodithioate (Brill, et al. (1989) J. Am. 
Chem. Soc. 111:2321-322); O-methylphophoroamidite linkages (see Eckstein (1992) 
Oligonucleotides and Analogues: A Practical Approach Oxford Univ. Press); and peptide 

25 nucleic acid backbones and linkages (see Egholm (1992) J. Am. Chem. Soc. 1 14:1 895-897; 
Meier, et al. (1992) Chem. Int. Ed. Engl 31:1008-010; Nielsen (1993) Nature 365:566-568; 
Carlsson, et al. (1996) Nature 380:207. Other analog nucleic acids include those with 
positively charged backbones(Denpcy, et al. (1995) Proc. Natl Acad. Sci. USA 92:6097- 
101); non-ionic backbones (U.S. Patent Nos. 5,386,023; 5,637,684; 5,602,240; 5,216,141; 

30 and 4,469,863; Kiedrowshi, et al. (1991) Angew. Chem. Intl. Ed. English 30:423-426; 
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Letsinger, et al. (1988) J. Am. Chem. Soc. 110:4470-471; Jung, et al. (1994) Nucleoside and 
Nucleotide 13:1597-xxx; Chapters 2-3 of Sanghvi and Cook (eds. 1994) Carbohydrate 
Modifications in Antisense Research ACS Symposium Series 580; Mesmaeker, et al. (1994) 
Bioorganic and Medicinal Chem. Lett. 4:395-398; Jeffs, et al. (1994) J. Biomolecular NMR 
5 34:17; Horn, et al. (1996) Tetrahedron Lett. 37:743-xxx); and non-ribose backbones (see U.S. 
Patent Nos. 5,235,033 and 5,034,506, and Chapters 6-7 of Sanghvi and Cook (eds. 1994) 
Carbohydrate Modifications in Antisense Research ACS Symposium Series 580. Nucleic 
acids containing one or more carbocyclic sugars are also contemplated. See Jenkins and 
Turner (1995) Chem. Soc. Rev. 24:169-176. Several nucleic acid analogs are described in 

10 Rawls (page 35, June 2, 1997) C&E News . Modifications of the ribose-phosphate backbone 
may be made, e.g., to increase the stability and half-life of such molecules in physiological 
environments or as probes on a biochip. Mixtures of naturally occurring nucleic acids and 
analogs can be made; alternatively, mixtures of different nucleic acid analogs, and mixtures 
of naturally occurring nucleic acids and analogs may be made. 

1 5 Particularly preferred are peptide nucleic acids (PNA) which include peptide nucleic 

acid analogs. These backbones are substantially non-ionic under neutral conditions, in 
contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
The PNA backbone typically exhibits improved hybridization kinetics, exhibiting larger 
changes in the melting temperature (Tm) for mismatched versus perfectly matched basepairs. 

20 DNA and RNA typically exhibit a 2-4° C drop in T m for an internal mismatch. With the 
non-ionic PNA backbone, the drop is closer to 7-9° C. And due to their non-ionic nature, 
hybridization of the polymers is relatively insensitive to salt concentration. In addition, 
PNAs are not as easily degraded by cellular enzymes, and can be more stable. 

The nucleic acids may be single stranded or double stranded, or contain portions of 

25 both double stranded or single stranded sequence. As will be appreciated by those in the art, 
the depiction of a single strand also defines the sequence of the complementary strand; thus 
the sequences described herein also provide the complement of the sequence. The nucleic 
acid may be DNA, both genomic and cDNA, RNA, or a hybrid, where the nucleic acid may 
contain combinations of deoxyribo- and ribo-nucleotides, and combinations of bases, 

30 including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine, hypoxanthine, 
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isocytosine, isoguanine, etc. "Transcript" typically refers to a naturally occurring RNA, e.g., 
a pre-mRNA, hnRNA, or mRNA. As used herein, the term "nucleoside" includes nucleotides 
and nucleoside and nucleotide analogs, and modified nucleosides such as amino modified 
nucleosides. In addition, "nucleoside" includes non-naturally occurring analog structures. 
5 Thus, e.g., the individual units of a peptide nucleic acid, each containing a base, are referred 
to herein as a nucleoside. 

A "label" or "detectable moiety" is a composition detectable by spectroscopic, 
photochemical, biochemical, immunochemical, chemical, or other physical means. Direct or 
indirect methods are comtemplated. For example, useful labels include 32p s fluorescent 

10 dyes, electron-dense reagents, enzymes (e.g., as commonly used in an ELISA), biotin, 

digoxigenin, or haptens and proteins or other entities which can be made detectable, e.g., by 
incorporating a radiolabel into the peptide or used to detect antibodies specifically reactive 
with the peptide. The labels may be incorporated into the bladder cancer nucleic acids, 
proteins, and antibodies. Methods are well known for conjugating the antibody to the label, 

1 5 including those methods described by Hunter, et al. (1962) Nature 144:945-946; David, et al. 
(1974)' Biochemistry 13:1014-021; Pain, et al. (1981) J. Immunol. Meth. 40:219-230; and 
Nygren (1982) J. Histochem. and Cvtochem. 30:407-412. 

An "effector" or "effector moiety" or "effector component" is a molecule that is bound 
(or linked, or conjugated), either covalently, through a linker or a chemical bond, or 

20 noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to a target, 
e.g., an antibody. The "effector" can be a variety of molecules including, e.g., detection 
moieties including radioactive compounds; fluorescent compounds; an enzyme or substrate; 
tags such as epitope tags; a toxin; activatable moieties; a chemotherapeutic agent; a lipase; an 
antibiotic; a radioisotope emitting "hard", e.g., beta radiation; or an attracting moiety. 

25 A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 

covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 
using high affinity interactions may achieve the same results where one of a pair of binding 

30 partners binds to the other, e.g., biotin, streptavidin. 
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As used herein a "nucleic acid probe or oligonucleotide" is defined as a nucleic acid 
capable of binding to a target nucleic acid of complementary sequence through one or more 
types of chemical bonds, usually through complementary base pairing, usually through 
hydrogen bond formation. As used herein, a probe may include natural (e.g., A, G, C, or T) 
5 or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be 
joined by a linkage other than a phosphodiester bond, so long as it does not functionally 
interfere with hybridization. Thus, e.g., probes may be peptide nucleic acids in which the 
constituent bases are joined by peptide bonds rather than phosphodiester linkages. Probes 
may bind target sequences lacking complete complementarity with the probe sequence 

1 0 depending upon the stringency of the hybridization conditions. The probes are preferably 
directly labeled as with isotopes, chromophores, lumiphores, chromogens, or indirectly 
labeled such as with biotin to which streptavidin linked label may bind. By assaying for the 
presence or absence of the probe, one can detect the presence or absence of the select 
sequence or subsequence. Diagnosis or prognosis may be based at the genomic level, or at 

1 5 the level of RNA or protein expression. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic acid, 
protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been modified by 
the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic 
acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., recombinant 

20 cells express genes that are not found within the native (non-recombinant) form of the cell or 
express native genes that are otherwise abnormally expressed, under expressed, or not 
expressed at all. By the term "recombinant nucleic acid" herein is meant nucleic acid, 
originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 
polymerases and endonucleases, in a form not normally found in nature. In this manner, 

25 operable linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 
understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, e.g., using in vivo cellular machinery of the 

30 host cell rather than in vitro manipulations; however, such nucleic acids, once produced 

19 



WO 03/003906 



PCT/US02/21338 



recombinantly, although subsequently replicated non-recombinantly, are still considered 
recombinant for the purposes of the invention. Similarly, a "recombinant protein" is a protein 
made using recombinant techniques, e.g., through the expression of a recombinant nucleic 
acid as depicted above. 

5 The term "heterologous" when used with reference to portions of a nucleic acid 

indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
1 0 coding region from another source. Similarly, a heterologous protein will often refer to two 
or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that direct 
transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic acid 

1 5 sequences near the start site of transcription, such as, in the case of a polymerase II type 

promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 

20 active under environmental or developmental regulation. The term "operably linked" refers 
to a functional linkage between a nucleic acid expression control sequence (such as a 
promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

25 An "expression vector" is a nucleic acid construct, generated recombinantly or 

synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter. 
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The phrase "selectively (or specifically) hybridizes to" refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 
5 The phrase "stringent hybridization conditions" refers to conditions under which a 

probe will hybridize to its target subsequence, typically in a complex mixture of nucleic 
acids, but to no other sequences. Stringent conditions are sequence-dependent and will be 
different in different circumstances. Longer sequences hybridize specifically at higher 
temperatures. An extensive guide to the hybridization of nucleic acids is found in "Overview 

10 of principles of hybridization and the strategy of nucleic acid assays" in Tijssen (1993) 

Hybridization with Nucleic Probes (Techniques in Biochemistry and Molecular Biology; vol. 
24) Elsevier. Generally, stringent conditions are selected to be about 5-10° C lower than the 
thermal melting point (T^ for the specific sequence at a defined ionic strength pH. The T m 
is the temperature (under defined ionic strength, pH, and nucleic acid concentration) at which 

1 5 50% of the probes complementary to the target hybridize to the target sequence at 

equilibrium (as the target sequences are present in excess, at T m , 50% of the probes are 
occupied at equilibrium). Stringent conditions will be those in which the salt concentration is 
less than about 1 .0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or 
other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C for short probes (e.g., 

20 about 10-50 nucleotides) and atleast about 60° C for long probes (e.g., greater than about 50 
nucleotides). Stringent conditions may also be achieved with the addition of destabilizing 
agents such as formamide. For selective or specific hybridization, a positive signal is at least 
about two times background, preferably about 10 times background hybridization. 
Exemplary stringent hybridization conditions can be as following; 50% formamide, 5x SSC, 

25 and 1% SDS, incubating at 42° C, or, 5x SSC, 1% SDS, incubating at 65° C, with wash in 
0.2x SSC, and 0.1% SDS at 65° C. For PCR, a temperature of about 36° C is typical for low 
stringency amplification, although annealing temperatures may vary between about 32-48° C 
depending on primer length. For high stringency PCR amplification, a temperature of about 
62° C is typical, although high stringency annealing temperatures can range from about 50- 
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65° C, depending on the primer length and specificity. Typical cycle conditions for both high 
and low stringency amplifications include a denaturation phase of 90-95° C for 30-120 sec, 
an annealing phase lasting 30-120 sec, and an extension phase of about 72° C for 1-2 min. 
Protocols and guidelines for low and high stringency amplification reactions are provided, 
5 e.g., in Innis, et al. (1990) PCR Protocols, A Guide to Methods and Applications Academic 
Press NY. 

Nucleic acids that do not hybridize to each other under stringent conditions are still 
substantially identical if the polypeptides which they encode are substantially identical. This 
occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy 

10 permitted by the genetic code. In such cases, the nucleic acids typically hybridize under 
moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 
1% SDS at 37° C, and a wash in IX SSC at 45° C. A positive hybridization is at least about 
twice background. Alternative hybridization and wash conditions can be utilized to provide 

15 conditions of similar stringency. Additional guidelines for determining hybridization 

parameters are provided in numerous references, e.g.,. Ausubel, et al. Current Protocols in 
Molecular Biology Lippincott. 

The phrase "functional effects" in the context of assays for testing compounds that 
modulate activity of a bladder cancer protein includes the determination of a parameter that is 

20 indirectly or directly under the influence of the bladder cancer protein or nucleic acid, e.g., a 
functional, physical, or chemical effect, such as the ability to decrease bladder cancer. It 
includes ligand binding activity; cell growth on soft agar; anchorage dependence; contact 
inhibition and density limitation of growth; cell viability, cellular proliferation; cellular 
transformation; growth factor or serum dependence; tumor specific marker levels; 

25 invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and protein 

expression in cells undergoing metastasis, and other characteristics of bladder cancer cells. 
"Functional effects" include in vitro, in vivo, and ex vivo activities. 

By "determining the functional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of a 

30 bladder cancer protein sequence, e.g., functional, enzymatic, physical and chemical effects. 
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Such functional effects can be measured by many means known to those skilled in the art, 
e.g., changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refractive 
index), hydrodynamic (e.g., shape), chromatographic, or solubility properties for the protein, 
measuring inducible markers or transcriptional activation of the bladder cancer protein; 
5 measuring binding activity or binding assays, e.g., binding to antibodies or other ligands, and 
measuring cellular proliferation or metabolism. Determination of the functional effect of a 
compound on bladder cancer can also be performed using bladder cancer assays, such as, in 
vitro assays, e.g., cell growth on soft agar; anchorage dependence; contact inhibition and 
density limitation of growth; cellular proliferation; cellular transformation; growth factor or 

10 serum dependence; tumor specific marker levels; invasiveness into Matrigel; tumor growth 
and metastasis in vivo; mRNA and protein expression in cells undergoing metastasis, and 
other characteristics of bladder cancer cells. Functional effects can be evaluated by many 
means, e.g., microscopy for quantitative or qualitative measures of alterations in 
morphological features, measurement of changes in RNA or protein levels for bladder 

15 cancer-associated sequences, measurement of RNA stability, identification of downstream or 
reporter gene expression (CAT, luciferase, p-gal, GFP, and the like), e.g., via 
chemiluminescence, fluorescence, colorimetric reactions, antibody binding, inducible 
markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of bladder cancer polynucleotide and 

20 polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules or 
compounds identified using in vitro and in vivo assays of bladder cancer polynucleotide and 
polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block 
activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity or expression of bladder cancer proteins, e.g., antagonists, Antisense nucleic acids 

25 may seem to inhibit expression and subsequent function of the protein. "Activators" are 

compounds that increase, open, activate, facilitate, enhance activation, sensitize, agonize, or 
up regulate bladder cancer protein activity. Inhibitors, activators, or modulators also include 
genetically modified versions of bladder cancer proteins, e.g., versions with altered activity, 
as well as naturally occurring and synthetic ligands, antagonists, agonists, antibodies, small 

30 chemical molecules and the like. Such assays for inhibitors and activators include, e.g., 
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expressing the bladder cancer protein in vitro, in cells, or cell membranes, applying putative 
modulator compounds, and then determining the functional effects on activity, as described 
above. Activators and inhibitors of bladder cancer can also be identified by incubating 
bladder cancer cells with the test compound and determining increases or decreases in the 
5 expression of 1 or more bladder cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 or 
. more bladder cancer proteins, such as bladder cancer proteins encoded by the sequences set 
out in Tables 1A-13. 

Samples or assays comprising bladder cancer proteins that are treated with a potential 
activator, inhibitor, or modulator are compared to control samples without the inhibitor, 

10 activator, or modulator to examine the extent of inhibition. Control samples (untreated with 
inhibitors) are assigned a relative protein activity value of 100%. Inhibition of a polypeptide 
is achieved when the activity value relative to the control is about 80%, preferably about 
50%, more preferably about 25-0%. Activation of a bladder cancer polypeptide is achieved 
when the activity value relative to the control (untreated with activators) is about 110%, more 

1 5 preferably about 150%, more preferably about 200-500% (e.g., two to five fold higher 
relative to the control), more preferably about 1000-3000% higher. 

The phrase "changes in cell growth" refers to a change in cell growth and proliferation 
characteristics in vitro or in vivo, such as cell viability, formation of foci, anchorage 
independence, semi-solid or soft agar growth, changes in contact inhibition and density 

20 limitation of growth, loss of growth factor or serum requirements, changes in cell 

morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 
ability to form or suppress tumors when injected into suitable animal hosts, and/or 
immortalization of the cell. See, e.g., pp. 231-241 of Freshney (1994) Culture of Animal 
Cells: A Manual of Basic Technique (3d ed.). 

25 "Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 

"Cancer cells," "transformed" cells or "transformation" in tissue culture, refers to 
spontaneous or induced phenotypic changes that do not necessarily involve the uptake of new 
genetic material. Although transformation can arise from infection with a transforming virus 
and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 

30 spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
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Transformation is associated with phenotypic changes, such as immortalization of cells, 
aberrant growth control, nonmorphological changes, and/or malignancy. See, Freshney 
(2000) Culture of Animal Cells: A Manual of Basic Technique (4th ed.) Wiley-Liss. 
"Antibody 11 refers to a polypeptide comprising a framework region from an 
5 immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 

10 IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul 
(ed. 1999) Fundamental Immunology (4th ed.) Raven. 

An exemplary immunoglobulin (antibody) structural unit comprises a tetramer. Each 
tetramer is composed of two identical pairs of polypeptide chains, each pair having one 

15 "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each 

chain defines a variable region of about 1 00 to 1 1 0 or more amino acids primarily responsible 
for antigen recognition. The terms variable light chain (VjJ and variable heavy chain (Vg) 

refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well-characterized 

20 fragments produced by digestion with various peptidases. Thus, e.g., pepsin digests an 

antibody below the disulfide linkages in the hinge region to produce F(ab)'2, a dimer of Fab 
which itself is a light chain joined to Vr-ChI by a disulfide bond. The F(ab)'2 may be 
reduced under mild conditions to break the disulfide linkage in the hinge region, thereby 
converting the F(ab) f 2 dimer into an Fab 1 monomer. The Fab 1 monomer is essentially Fab 

25 with part of the hinge region. See Paul (ed. 1999) Fundamental Immunology (4th ed.) Raven. 
While various antibody fragments are defined in terms of the digestion of an intact antibody, 
such fragments may be synthesized de novo either chemically or by using recombinant DNA 
methodology. Thus, the term antibody, as used herein, also includes antibody fragments 
either produced by the modification of whole antibodies, or those synthesized de novo using 
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recombinant DNA methodologies (e.g., single chain Fv) or those identified using phage 
display libraries. See, e.g., McCafferty, et al. (1990) Nature 348:552-554. 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many techniques can be used. See, e.g., Kohler and Milstein (1 975) Nature 
5 256:495-497; Kozbor, et al. (1983) Immunology Today 4:72; Cole, et al. pp. 77-96 in 
^Reisfeld and Sell (1985) Monoclonal Antibodies and Cancer Therapy Liss; Coligan (1991) 
Currerit Protocols in Immunology Lippincott; Harlow and Lane (1988) Antibodies: A 
Laboratory Manual CSH Press; and Goding (1986) Monoclonal Antibodies: Principles and 
Practice (2d ed.) Academic Press. Techniques for the production of single chain antibodies 

10 (U.S. Patent 4,946,778) can be adapted to produce antibodies to polypeptides of this 

invention. Also, transgenic mice, or other organisms such as other mammals, may be used to 
express humanized antibodies. Alternatively, phage display technology can be used to 
identify antibodies and heteromeric Fab fragments that specifically bind to selected antigens. 
See, e.g., McCafferty, et al. (1990) Nature 348:552-554; and Marks, et al (1992) 

15 Biotechnology 10:779-783. 

A "chimeric antibody" is an antibody molecule in which (a) the constant region, or a 
portion thereof, is altered, replaced, or exchanged so that the antigen binding site (variable 
region) is linked to a constant region of a different or altered class, effector function, and/or 
species, or an entirely different molecule which confers new properties to the chimeric 

20 antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 

region, or a portion thereof, is altered, replaced, or exchanged with a variable region having a 
different or altered antigen specificity. 

Identification of bladder cancer-associated sequences 

25 In one aspect, the expression levels of genes are determined in different patient 

samples for which diagnosis information is desired, to provide expression profiles. An 
expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have a particular gene similarly expressed, the evaluation of a 
number of genes simultaneously allows the generation of a gene expression profile that is 

30 characteristic of the state of the cell. That is, normal tissue (e.g., normal bladder or other 
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tissue) may be distinguished from cancerous or metastatic cancerous tissue of the bladder, or 
bladder cancer tissue or metastatic bladder cancerous tissue can be compared with tissue 
samples of bladder and other tissues from surviving cancer patients. By comparing 
expression profiles of tissue in known different bladder cancer states, information regarding 
5 which genes are important (including both up- and down-regulation of genes) in each of these 
states is obtained. 

The identification of sequences that are differentially expressed in bladder cancer 
versus non-bladder cancer tissue allows the use of this information in a number of ways. For 
example, a particular treatment regime may be evaluated: does a chemotherapeutic drug act 

10 to down-regulate bladder cancer, and thus tumor growth or recurrence, in a particular patient; 
or does chemotherapy or radiation therapy induce expression of particular targets. Similarly, 
diagnosis and treatment outcomes may be done or confirmed by comparing patient samples 
with the known expression profiles. Metastatic tissue can also be analyzed to determine the 
stage of bladder cancer in the tissue or origin of a primary tumor. Furthermore, these gene 

1 5 expression profiles (or individual genes) allow screening of drug candidates with an eye to 
mimicking or altering a particular expression profile; e.g., screening can be. done for drugs 
that suppress the bladder cancer expression profile. This may be done by making biochips 
comprising sets of important bladder cancer genes, which can then be used in these screens. 
These methods can also be applied on the protein basis; that is, protein expression levels of 

20 the bladder cancer proteins can be evaluated for diagnostic purposes or to screen candidate 
agents. In addition, the bladder cancer nucleic acid sequences can be administered for gene 
therapy purposes, including the administration of antisense or inhibitory nucleic acids, 1 or the 
bladder cancer proteins (including antibodies and other modulators thereof) administered as 
therapeutic drugs. 

25 Thus the present invention provides nucleic acid and protein sequences that are 

differentially expressed in bladder disease or cancer relative to normal tissues and/or non- 
malignant bladder tissue, herein termed "bladder cancer sequences." As outlined below, 
bladder cancer sequences include those that are up-regulated (e.g., expressed at a higher 
level) in bladder cancer, as well as those that are down-regulated (e.g., expressed at a lower 

30 level). In a preferred embodiment, the bladder cancer sequences are from humans; however, 
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as will be appreciated by those in the art, bladder cancer sequences from other organisms may 
be useful in animal models of disease and drug evaluation; thus, other bladder cancer 
sequences are provided, from vertebrates, including mammals, including rodents (rats, mice, 
hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, 
5 horses, etc.) and pets, e.g., (dogs, cats, etc.). Bladder cancer sequences from other organisms 
may be obtained using the techniques outlined below. 

Bladder cancer sequences can include both nucleic acid and amino acid sequences. 
Bladder cancer nucleic acid sequences are useful in a variety of applications, including 
diagnostic applications, which will detect naturally occurring nucleic acids, as well as 

1 0 screening applications. Biochips comprising nucleic acid probes or PCR microtiter plates 
with selected probes to the bladder cancer sequences can be generated. 

A bladder cancer sequence can be initially identified by substantial nucleic acid 
and/or amino acid sequence homology to the bladder cancer sequences outlined herein. Such 
homology can be based upon the overall nucleic acid or amino acid sequence, and is 

1 5 generally determined as outlined below, using either homology programs or hybridization 
conditions. 

For identifying bladder cancer-associated sequences, the bladder cancer screen 
typically includes comparing genes identified in different tissues, e.g., normal, non- 
malignant, or cancerous tissues, or tumor tissue samples from patients who have metastatic 

20 disease vs. non metastatic tissue. Other suitable tissue comparisons include comparing 
bladder cancer samples with metastatic cancer samples from other cancers, such as lung, 
bladder, gastrointestinal cancers, ovarian, etc. Samples of different stages of bladder cancer, 
e.g., survivor tissue, drug resistant states, and tissue undergoing metastasis, are applied to 
biochips comprising nucleic acid probes. The samples are first microdissected, if applicable, 

25 and treated for the preparation of mRNA. Suitable biochips are commercially available, e.g., 
from Affymetrix. Gene expression profiles as described herein are generated and the data 
, analyzed. 

In one embodiment, genes showing changes in expression as between normal and 
disease states are compared to genes expressed in other normal tissues, preferably normal 
30 bladder, but also including, and not limited to lung, heart, brain, liver, bladder, kidney, 
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muscle, colon, small intestine, large intestine, spleen, bone, and placenta. In a preferred 
embodiment, those genes identified during the bladder cancer screen that are expressed in a 
significant amount in other tissues are removed from the profile, although in some 
embodiments, this is not necessary. That is, when screening for drugs, it is usually preferable 
5 that the target be disease specific, e.g., not be expressed on critical organs. 

In a preferred embodiment, bladder cancer sequences are those that are up-regulated 
in bladder cancer; that is, the expression of these genes is higher in the bladder cancer tissue 
as compared to non-cancerous tissue. "Up-regulation" as used herein often means at least 
about a two-fold change, preferably at least about a three fold change, with at least about 

10 five-fold or higher being preferred. Unigene cluster identification numbers and accession 
numbers herein are for the GenBank sequence database and sequences of accession numbers 
are hereby expressly incorporated by reference. GenBank is known in the art, see, e.g., 
Benson, et al. (1998) Nuc. Acids Res. 26:1-7 and http://www.ncbi.nlm.nih.gov/. Sequences 
are also available in other databases, e.g., European Molecular Biology Laboratory (EMBL) 

1 5 and DNA Database of Japan (DDBJ). 

In another preferred embodiment, bladder cancer sequences are those that are down- 
regulated in the bladder cancer; that is, the expression of these genes is lower in bladder 
cancer tissue as compared to non-cancerous tissue (see, e.g., Tables 1A-13). "Down- 
regulation" as used herein often means at least about a two-fold change, preferably at least 

20 about a three fold change, with at least about five- fold or higher being preferred. 

Informatics 

The ability to identify genes that are over or under expressed in bladder cancer can 
additionally provide high-resolution, high-sensitivity datasets which can be used in the areas 
25 of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 

biosensor development, and other related areas. For example, the expression profiles can be 
used in diagnostic or prognostic evaluation of patients with bladder cancer. Or as another 
example, subcellular toxicological information can be generated to better direct drug structure 
and activity correlation. See Anderson (June 11-12, 1998) Pharmaceutical Proteomics: 
30 Targets, Mechanism, and Function, paper presented at the IBC Proteomics conference, 
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Coronado, CA. Subcellular toxicological information can also be utilized in a biological 
sensor device to predict the likely toxicological effect of chemical exposures and likely 
tolerable exposure thresholds (see U.S. Patent No. 5,81 1,231). Similar advantages accrue 
from datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, 
5 saccharides, lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database that includes 
at least one set of assay data. The data contained in the database is acquired, e.g., using array 
analysis either singly or in a library format. The database can be in a form in which data can 
be maintained and transmitted, but is preferably an electronic database. The electronic 

10 database of the invention can be maintained on an electronic device allowing for the storage 
of and access to the database, such as a personal computer, but is preferably distributed on a 
wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence data is for 
clarity of illustration only. It will be apparent that similar databases can be assembled for 

15 assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative and/or 
absolute abundance of a variety of molecular and macromolecular species from a biological 
sample undergoing bladder cancer, e.g., the identification of bladder cancer-associated 
sequences described herein, provide an abundance of information, which can be correlated 

20 with pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, 
gene-disease causal linkages, identification of correlates of immunity and physiological 
status, among others. Although the data generated from the assays of the invention is suited 
for manual review and analysis, in a preferred embodiment, prior data processing using high- 
speed computers is utilized. 

25 An array of methods for indexing and retrieving biomolecular information is known 

in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational database 
system for storing biomolecular sequence information in a manner that allows sequences to 
be catalogued and searched according to one or more protein function hierarchies. U.S. 
Patent 5,953,727 discloses a relational database having sequence records containing 

30 information in a format that allows a collection of partial-length DNA sequences to be 

30 



WO 03/003906 



PCT/US02/21338 



catalogued and searched according to association with one or more sequencing projects for 
obtaining fiill-Iength sequences from the collection of partial length sequences. U.S. Patent 
5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 
between a key sequence and a target sequence. .U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 
described as on-line analytical processing (OLAP), which entails the consolidation of 
projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a. hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 
the merger of two or more such tree structures. 

See also Mount, et al. (2001) Bioinformatics CSH Press, NY; Durbin, et al. (eds. 
19 ") Biological Sequence Analy sis: Probabilistic Models of Proteins and Nucleic Acids 
Cambridge Univ. Press; Baxevanis and Oeullette (eds. 1998) Bioinformatics: A Practical 
Guide to the Analysis of Genes and Proteins (2d. ed.) Wiley-Liss; Rashidi and Buehler 
(1999) Bioinformatics: Basic A pplications in Biological Science and Medicine CRC Press; 
Setubal, et al. (eds 1997) Introduction to Computational Molecular Biology Brooks/Cole; 
Misener and Krawetz (eds. 2000) Bioinformatics: Methods and Protocols Oxford Univ. 
Press; Higgins and Taylor (eds. 2000) Bioinformatics: Sequence. Structure, and Databanks: A 
Practical Approach Oxford Univ. Press; Brown (2001) Bioinformatics: A Biologist's Guide to 
Biocomputing and the Internet Eaton Pub.; Han and Kamber (2000) Data Mining: Concep ts 
and Techniques Kaufmann Pub.; and Waterman (1995) Introduction to Computational 
Biology: M aps. Seq uen ces, and Genomes Chap and Hall. 

The present invention provides a computer database comprising a computer and 
software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
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with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing sample 
is from a control tissue sample known to be free of pathological disorders. In a variation, at 
5 least one of the sources is a known pathological tissue specimen, e.g., a neoplastic lesion or 
another tissue specimen to be analyzed for bladder cancer. In another variation, the assay 
records cross-tabulate one or more of the following parameters for each target species in a 
sample: (1) a unique identification code, which can include, e.g., a target molecular structure 
and/or characteristic separation coordinate (e.g., electrophoretic coordinates); (2) sample 

1 0 source; and (3) absolute and/or relative quantity of the target species present in the sample. 

The invention also provides for the storage and retrieval of a collection of target data 
in a computer data storage apparatus, which can include magnetic disks, optical disks, 
magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, magnetic 
bubble memory devices, and other data storage devices, including CPU registers and on-CPU 

1 5 data storage arrays. Typically, the target data records are stored as a bit pattern in an array of 
magnetic domains on a magnetizable medium or as an array of charge states or transistor gate 
states, such as an array of cells in a DRAM device (e.g., each cell comprised of a transistor . 
and a charge storage area, which may be on the transistor). In one embodiment, the invention 
provides such storage devices, and computer systems built therewith, comprising a bit pattern 

20 encoding a protein expression fingerprint record comprising unique identifiers for at least 10 
target data records cross-tabulated with target source. 

When the target is a peptide or nucleic acid, the invention preferably provides a 
method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 

25 or retrieved from a computer storage device or database and at least one other sequence. The 
comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FAST A, TFASTA, GAP, BESTFIT) and/or the comparison may 
be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 
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The invention also preferably provides a magnetic disk, such as an IBM-compatible 
(DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format (e.g., Linux, 
SunOS, Solaris, AK, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or hard (fixed, 
Winchester) disk drive, comprising a bit pattern encoding data from an assay of the invention 
5 in a file format suitable for retrieval and processing in a computerized sequence analysis, 
comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing devices 
linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, ISDN 
line, wireless network, optical fiber, or other suitable signal transmission medium, whereby at 
10 least one network device (e.g., computer, disk array, etc.) comprises a pattern of magnetic 
domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM cells) 
composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
15 ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for comparing a 
query target to a database containing an array of data structures, such as an assay result 
20 obtained by the method of the invention, and ranking database targets based on the degree of 
identity and gap weight to the target data. A central processor is preferably initialized to load 
and execute the computer program for alignment and/or comparison of the assay results. 
Data for a query target is entered into the central processor via an I/O device. Execution of 
the computer program results in the central processor retrieving the assay data from the data 
25 file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to secondary 
memory, which is typically random access memory (e.g., DRAM, SRAM, SGRAM, or 
SDRAM). Targets are ranked according to the degree of correspondence between a selected 
assay characteristic (e.g., binding to a selected affinity moiety) and the same characteristic of 
30 the query target and results are output via an I/O device. For example, a central processor 
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can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, PA-8000, SPARC, 
MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or public domain 
molecular biology software package (e.g., UWGCG Sequence Analysis Software, Darwin); a 
data file can be an optical or magnetic disk, a data server, a memory device (e.g., DRAM, 
5 SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, etc.); an I/O device can 
be a terminal comprising a video display and a keyboard, a modem, an ISDN terminal 
adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or other suitable I/O 
device. 

The invention also preferably provides the use of a computer system, such as that 
10 described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 

collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

15 

Characteristics of bladder cancer-associated proteins 

Bladder cancer proteins of the present invention may be classified as secreted 
proteins, transmembrane proteins or intracellular proteins. In one embodiment, the bladder 
cancer protein is an intracellular protein. Intracellular proteins may be found in the 

20 cytoplasm and/or in the nucleus. Intracellular proteins are involved in all aspects of cellular 
function and replication (including, e.g., signaling pathways); aberrant expression of such 
proteins often results in unregulated or disregulated cellular processes (see, e.g., Alberts, et 
al. (1994) Molecular Biology of the Cell (3d ed.) Garland. For example, many intracellular 
proteins have enzymatic activity such as protein kinase activity, protein phosphatase activity, 

25 protease activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular 
proteins also serve as docking proteins that are involved in organizing complexes of proteins, 
or targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence in the 

30 proteins of one or more structural motifs for which defined functions have been attributed. In 

34 



WO 03/003906 



PCT/US02/21338 



addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 
5 domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 

targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 
few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of amino acid 

10 sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 
One useful database is Pfam (protein families), which is a large collection of multiple 
sequence alignments and hidden Markov models covering many common protein domains. 
Versions are available via the internet from Washington University in St. Louis, the Sanger 

15 Center in England, and the Karolinska Institute in Sweden. See, e.g., Bateman, et al. (2000) 
Nuc. Acids Res . 28:263-266; Sonnhammer, et al. (1997) Proteins 28:405-420; Bateman, et al. 
(1999) Nuc. Acids Res. 27:260-262; and Sonnhammer, et al. (1998) Nuc. Acids Res. 26:320- 
322. 

In another embodiment, the bladder cancer sequences are transmembrane proteins. 

20 Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. They may 
have an intracellular domain, an extracellular domain, or both. The intracellular domains of 
such proteins may have a number of functions including those already described for 
intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 

25 domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane domains. 
30 For example, receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl cyclases 
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and receptor serine/threonine protein kinases contain a single transmembrane domain. 
However, various other proteins including channels and adenylyl cyclases contain numerous 
transmembrane domains. Many important cell surface receptors such as G protein coupled 
receptors (GPCRs) are classified as "seven transmembrane domain" proteins, as they contain 
5 7 membrane spanning regions. Characteristics of transmembrane domains include 

approximately 1 7 consecutive hydrophobic amino acids that may be followed by charged 
amino acids. Therefore, upon analysis of the amino acid sequence of a particular protein, the 
localization and number of transmembrane domains within the protein may be predicted (see, 
e.g., PSORT web site http://psort.nibb.ac.jp/). Important transmembrane protein receptors 

10 include, but are not limited to the insulin receptor, insulin-like growth factor receptor, human 
growth hormone receptor, glucose transporters, transferrin receptor, epidermal growth factor 
receptor, low density lipoprotein receptor, epidermal growth factor receptor, leptin receptor, 
and interleukin receptors, e.g., IL-1 receptor, IL-2 receptor, etc. 

The extracellular domains of transmembrane proteins are diverse; however, conserved 

1 5 motifs are found repeatedly among various extracellular domains. Conserved structure 
and/or functions have been ascribed to different extracellular motifs. Many extracellular 
domains are involved in binding to other molecules. In one aspect, extracellular domains are 
found on receptors. Factors that bind the receptor domain include circulating ligands, which 
may be peptides, proteins, or small molecules such as adenosine and the like. For example, 

20 growth factors such as EGF, FGF, and PDGF are circulating growth factors that bind to their 
cognate receptors to initiate a variety of cellular responses. Other factors include cytokines, 
mitogenic factors, neurotrophic factors and the like. Extracellular domains also bind to cell- 
associated molecules. In this respect, they mediate cell-cell interactions. Cell-associated 
ligands can be tethered to the cell, e.g., via a glycosylphosphatidylinositol (GPI) anchor, or 

25 may themselves be transmembrane proteins. Extracellular domains also associate with the 
extracellular matrix and contribute to the maintenance of the cell structure. 

Bladder cancer proteins that are transmembrane are particularly preferred in the 
present invention as they are readily accessible targets for immunotherapeutics, as are 
described herein. In addition, as outlined below, transmembrane proteins can be also useful 

30 in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
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situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically permeablized to provide access to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein can be 
made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
5 Furthermore, transmembrane proteins that have been made soluble can be made to be 
secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the bladder cancer proteins are secreted proteins; the 
secretion of which can be either constitutive or regulated. These proteins may have a signal 
peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 

10 proteins are involved in numerous physiological events; e.g., if circulating, they often serve 
to transmit signals to various other cell types. The secreted protein may function in an 
autocrine manner (acting on the cell that secreted the factor), a paracrine manner (acting on 
cells in close proximity to the cell that secreted the factor), an endocrine manner (acting on 
cells at a distance, e.g, secretion into the blood stream), or exocrine (secretion, e.g., through a 

15 duct or to adjacent epithelial surface as sweat glands, sebaceous glands, pancreatic ducts, 
lacrimal glands, mammary glands, wax producing glands of the ear, etc.). Thus secreted 
molecules often find use in modulating or altering numerous aspects of physiology. Bladder 
cancer proteins that are secreted or released proteins are particularly preferred in the present 
invention as they serve as good targets for diagnostic markers, e.g., for blood, plasma, serum, 

20 or urine tests. Those which are enzymes may be antibody or small molecule targets. Others 
may be useful as vaccine targets, e.g., via CTL mechanisms. 

Use of bladder cancer nucleic acids 

As described above, bladder cancer sequence is initially identified by substantial 
25 nucleic acid and/or amino acid sequence homology or linkage to the bladder cancer 

sequences outlined herein. Such homology can be based upon the overall nucleic acid or 
amino acid sequence, and is generally determined as outlined below, using either homology 
programs or hybridization conditions. Typically, linked sequences on a mRNA are found on 
the same molecule. 
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The bladder cancer nucleic acid sequences of the invention, e.g., the sequences in 
Tables 1 A-13, can be fragments of larger genes, e.g., they are nucleic acid segments. 
"Genes" in this context includes coding regions, non-coding regions, and mixtures of coding 
and non-coding regions. Accordingly, as will be appreciated by those in the art, using the 
5 sequences provided herein, extended sequences, in either direction, of the bladder cancer 
genes can be obtained, using techniques well known in the art for cloning either longer 
sequences or the full length sequences; see Ausubel, et al., supra. Much can be done by 
informatics and many sequences can be clustered to include multiple sequences 
corresponding to a single gene, e.g., systems such as UniGene (see, 

1 0 http://www.ncbi.nlm.nih.gov/UniGene/). 

Once a bladder cancer nucleic acid is identified, it can be cloned and, if necessary, its 
constituent parts recombined to form the entire bladder cancer nucleic acid coding regions or 
the entire mRNA sequence. Once isolated from its natural source, e.g., contained within a 
plasmid or other vector or excised therefrom as a linear nucleic acid segment, the 

15 recombinant bladder cancer nucleic acid can be further-used as a probe to identify and isolate 
other bladder cancer nucleic acids, e.g., extended coding regions. It can also be used as a 
"precursor" nucleic acid to make modified or variant bladder cancer nucleic acids and 
proteins. 

The bladder cancer nucleic acids of the present invention are used in several ways. In 
20 a first embodiment, nucleic acid probes to the bladder cancer nucleic acids are made and 

attached to biochips to be used in screening and diagnostic methods, as outlined below, or for 
administration, e.g., for gene therapy, vaccine, and/or antisense/inhibition applications. 
Alternatively, the bladder cancer nucleic acids that include coding regions of bladder cancer 
proteins can be put into expression vectors for the expression of bladder cancer proteins, 
25 again for screening purposes or for administration to a patient 

In a preferred embodiment, nucleic acid probes to bladder cancer nucleic acids (both 
the nucleic acid sequences outlined in the tables and/or the complements thereof) are made. 
The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the bladder cancer nucleic acids, e.g., the target sequence (either the target 
30 sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 
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hybridization of the target sequence and the probes of the present invention occurs. As 
outlined below, this complementarity need not be perfect; there may be a number of base pair 
mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
5 is so great that no hybridization can occur under even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 
sequences to hybridize under normal reaction conditions, particularly high stringency 
conditions, as outlined herein. 

1 0 A nucleic acid probe is generally single stranded but can be partially single and 

partially double stranded. The strandedness of the probe is dictated by the structure, 
composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 

1 5 genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with either 
overlapping probes or probes to different sections of the target being used. That is, two, 
three, four or more probes, with three being preferred, are used to build in a redundancy for a 

20 particular target. The probes can be overlapping (e.g., have some sequence in common), or 
separate. In some cases, PCR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 
immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 

25 solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 
hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent 
attachment of a molecule, such as, streptavidin to the support and the non-covalent binding of 

30 the biotinylated probe to the streptavidin. By "covalent binding" and grammatical 
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equivalents herein is meant that the two moieties, the solid support and the probe, are 
attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
Covalent bonds can be formed directly between the probe and the solid support or can be 
formed by a cross linker or by inclusion of a specific reactive group on either the solid 
5 support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

In general, the probes are attached to the biochip in a wide variety of ways, as will be 
appreciated by those in the art. As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
10 the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid support" or 
other grammatical equivalents herein is meant a material that can be modified to contain 
discrete individual sites appropriate for the attachment or association of the nucleic acid 
probes and is amenable to at least one detection method. As will be appreciated by those in 

15 the art, the number of possible substrates are very large, and include, but are not limited to, 
glass and modified or functionalized glass, plastics (including acrylics, polystyrene and 
copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 
polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 

20 plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluoresce. See WO 00/55627. 

Generally the substrate is planar, although as will be appreciated by those in the art, 
other configurations of substrates may be used as well. For example, the probes may be 
placed on the inside surface of a tube, for flow-through sample analysis to minimize sample 

25 volume. Similarly, the substrate may be flexible, such as a flexible foam, including closed 
cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 
the biochip is derivatized with a chemical functional group including, but not limited to, 

30 amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
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particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g., using linkers as are known in the art; e.g., 
homo-or hetero-bifunctional linkers as are well known (see 1994 Pierce Chemical Company 
5 catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 

additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, and then 
attached to the surface of the solid support. As will be appreciated by those skilled in the art, 
10 either the 5' or 3' terminus may be attached to the solid support, or attachment may be via an 
internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very strong, 
yet non-covalent. For example, biotinylated oligonucleotides can be made, which bind to 
surfaces covalently coated with streptavidin, resulting in attachment. 

15 Alternatively, the oligonucleotides may be synthesized on the surface, as is known in 

the art. For example, photoactivation techniques utilizing photopolymerization compounds 
and techniques are used. In a preferred embodiment, the nucleic acids can be synthesized in 
situ, using well known photolithographic techniques, such as those described in WO 
95/25116; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references cited 

20 within, all of which are expressly incorporated by reference; these methods of attachment 
form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression level of 
bladder cancer-associated sequences. These assays are typically performed in conjunction 
with reverse transcription. In such assays, a bladder cancer-associated nucleic acid sequence 

25 acts as a template in an amplification reaction (e.g., Polymerase Chain Reaction, or PCR). In 
a quantitative amplification, the amount of amplification product will be proportional to the 
amount of template in the original sample. Comparison to appropriate controls provides a 
measure of the amount of bladder cancer-associated RNA. Methods of quantitative 
amplification are well known to those of skill in the art. Detailed protocols for quantitative 
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PCR are provided, e.g., in Innis, et al. (1990) PCR Protocols: A Guide to Methods and 
Applications Academic Press . 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5 1 fluorescent 
5 dye and a 3* quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3' end. When the PCR product is amplified in 
subsequent cycles, the 5* nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the V 
quenching agent, thereby resulting in an increase in fluorescence as a function of 

10 amplification. See, e.g., literature provided by Perkin-Elmer, e.g., www2.perkin-elmer.com. 
Other suitable amplification methods include, but are not limited to, ligase chain 
reaction (LCR) (see Wu and Wallace (1989) Genomics 4:560-569; Landegren, et al. (1988) 
Science 241:1077-1080; and Barringer, et al. (1990) Gene 89:117-122), transcription 
amplification (Kwoh, et al. (1989) Proc. Natl Acad. Sci. USA 86:1 173-1 177), self-sustained 

15 sequence replication (Guatelli, et al. (1990) Proc. Natl Acad. Sci. USA 87:1874-1878), dot 
PCR, and linker adapter PCR, etc. 

Expression of bladder cancer proteins from nucleic acids 

In a preferred embodiment, bladder cancer nucleic acids, e.g., encoding bladder 
20 cancer proteins, are used to make a variety of expression vectors to express bladder cancer 
proteins which can then be used in screening assays, as described below. Expression vectors 
and recombinant DNA technology are well known to those of skill in the art (see, e.g., 
Ausubel, supra, and Fernandez and Hoeffler (eds. 1999) Gene Expression Systems Academic 
Press) and are used to express proteins. The expression vectors may be either self-replicating 
25 extrachromosomal vectors or vectors which integrate into a host genome. Generally, these 
expression vectors include transcriptional and translational regulatory nucleic acid operably 
linked to the nucleic acid encoding the bladder qancer protein. The term "control sequences" 
refers to DNA sequences used for the expression of an operably linked coding sequence in a 
particular host organism. Control sequences that are suitable for prokaryotes, e.g., include a 
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promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are 
known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with 
another nucleic acid sequence. For example, DNA for a presequence or secretory leader is 
5 operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in 
the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding 
sequence if it affects the transcription of the sequence; a ribosome binding site is operably 
linked to a coding sequence if it is positioned so as to facilitate translation; two sequences 
may be operably linked if they are physically linked on a single polynucleotide. Generally, 

1 0 "operably linked" means that the DNA sequences being linked are contiguous, and, in the 
case of a secretory leader, contiguous and in reading phase. However, enhancers do not have 
to be contiguous. Linking is typically accomplished by ligation at convenient restriction 
sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are used in 
accordance with conventional practice. Transcriptional and translational regulatory nucleic 

1 5 acid will generally be appropriate to the host cell used to express the bladder cancer protein. 
Numerous types of appropriate expression vectors, and suitable regulatory sequences are 
known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, but are 
not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop 

20 sequences, translational start and stop sequences, and enhancer or activator sequences. In a 
preferred embodiment, the regulatory sequences include a promoter and transcriptional start 
and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The promoters 
may be either naturally occurring promoters or hybrid promoters. Hybrid promoters, which 
25 combine elements of more than one promoter, are useful in the present invention. 

In addition, an expression vector may comprise additional elements. The expression 
vector may have two replication systems, thus allowing it to be maintained in two organisms, 
e.g., in mammalian or insect cells for expression and in a procaryotic host for cloning and 
replication. For integrating expression vectors, the expression vector may contain at least one 
30 sequence homologous to the host cell genome, and preferably two homologous sequences 
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which flank the expression construct. The integrating vector may be directed to a specific 
locus in the host cell by selecting the appropriate homologous sequence for inclusion in the 
vector. Constructs for integrating vectors are well known (e.g., Fernandez and Hoeffler, 
supra). 

5 In addition, in a preferred embodiment, the expression vector contains a selectable 

marker gene to allow the selection of transformed host cells. Selection genes are well known 
in the art and will vary with the host cell used. 

The bladder cancer proteins of the present invention may be produced by culturing a 
host cdl transformed with an expression vector under the appropriate conditions to induce or 

10 cause expression of the bladder cancer protein. Conditions appropriate for bladder cancer 
protein expression will vary with the choice of the expression vector and the host cell, and 
will be easily ascertained by one skilled in the art through routine experimentation or 
optimization. For example, the use of constitutive promoters in the expression vector will 
typically require optimizing the growth and proliferation of the host cell, while the use of an 

1 5 inducible promoter typically requires identifying the appropriate growth conditions for 
induction. In addition, in some embodiments, the timing of the harvest is important. For 
example, the baculoviral systems used in insect cell expression are lytic viruses, and harvest 
time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, insect, and 

20 animal cells, including mammalian cells. Of particular interest are Saccharomyces cerevisiae 
and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, Neurospora, BHK, 
CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial cells), THP1 cells (a 
macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the bladder cancer proteins are expressed in mammalian 

25 cells. Mammalian expression systems include retroviral and adenoviral systems. Retroviral 
vector systems are described in PCT/US97/01019 and PCT/US97/01048. Of particular use 
are promoters from mammalian viral genes, since viral genes are often highly expressed and 
have a broad host range. Examples include the S V40 early promoter, mouse mammary tumor 
virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, and the 

30 CMV promoter (see, e.g., Fernandez and Hoeffler, supra). Typically, transcription 
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termination and polyadenylation sequences recognized by mammalian cells are regulatory 
regions located 3' to the translation stop codon and thus, together with the promoter elements, 
flank the coding sequence. Examples of transcription terminator and polyadenlyation signals 
include those derived from SV40. 
5 Methods of introducing exogenous nucleic acid into mammalian and other hosts are 

well known, and will vary with the host cell used. Techniques include dextran-mediated 
transfection, calcium phosphate precipitation, polybrene mediated transfection, protoplast 
fusion, electroporation, viral infection, encapsulation of the polynucleotide(s) in liposomes, 
and direct microinjection of the DNA into nuclei. 

10 In another embodiment, bladder cancer proteins are expressed in bacterial systems. 

Promoters from bacteriophage may also be used. Synthetic promoters and hybrid promoters 
are also useful; e.g., the tac promoter is a hybrid of the trp and lac promoter sequences. A 
bacterial promoter can include naturally occurring promoters of non-bacterial origin that have 
the ability to bind bacterial RNA polymerase and initiate transcription. Often an efficient 

15 ribosome binding site is desirable. The expression vector may include a signal peptide 
sequence that provides for secretion of the bladder cancer protein. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may include a selectable marker gene to allow for the selection of bacterial 

20 strains that have been transformed. Suitable selection genes include genes which render the 
bacteria resistant to drugs, e.g., ampicillin, chloramphenicol, erythromycin, kanamycin, 
neomycin, and tetracycline, or biosynthetic genes, e.g., those in the histidine, tryptophan, and 
leucine biosynthetic pathways. These components are assembled into expression vectors. 
Expression vectors for bacteria include vectors for Bacillus subtilis, E. coli, Streptococcus 

25 cremoris, and Streptococcus lividans, among others (e.g., Fernandez and Hoeffler, supra). 
The bacterial expression vectors are transformed into bacterial host cells using, e.g., calcium 
chloride treatment, electroporation, and other methods. 

Bladder cancer proteins can also be produced in insect cells. See, e.g., Miller, et al. 
(1997) Baculovirus Expression Vectors: A Laboratory Manual Oxford Books; ISBN: 

30 0716770172; and Makrides (1999) Prot. Expr. Purif. 17:183-202. 
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Bladder cancer protein may be produced in yeast cells. Yeast expression systems 
exist with expression vectors for Saccharomyces cerevisiae, Candida albicans and C. maltosa, 
Hansenula polymorphs Kluyveromyces fragilis and K. lactis, Pichia guillerimondii and P. 
pastoris, Schizosaccharomyces pombe, and Yarrowia lipolytica. See, e.g., Jones, et al. (eds. 
5 1 993) The Molecular and Cellular Biology of the Yeast Saccharomyces: Gene Expression 
CSH Press; ISBN: 0879693657. 

The bladder cancer protein may also be made as a fusion protein, using techniques 
well known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the desired 
epitope is small, the bladder cancer protein may be fused to a carrier protein to form an 

1 0 immunogen. Alternatively, the bladder cancer protein may be made as a fusion protein to 
increase expression, or for other reasons. For example, when the bladder cancer protein is a 
bladder cancer peptide, the nucleic acid encoding the peptide may be linked to other nucleic 
acid for expression or purification purposes. 

The bladder cancer protein is typically purified or isolated after expression. Bladder 

1 5 cancer proteins may be isolated or purified in a variety of ways, depending on what other 

components are present in the sample. Standard purification methods include electrophoretic, 
molecular, immunological, and chromatographic techniques, including ion exchange, 
hydrophobic, affinity, reverse-phase HPLC chromatography, and chromatofocusing. The 
bladder cancer protein may be purified using a standard anti-bladder cancer protein antibody 

20 affinity column. Ultrafiltration and diafiltration techniques, in conjunction with protein 

concentration, are also useful. For general guidance in suitable purification techniques, see, 
e.g., Scopes (1982) Protein Purification Springer- Verlag. The degree of purification 
necessary will vary depending on the use of the bladder cancer protein. In some instances no 
purification will be necessary, which may depend on the intended use. 

25 Once expressed and purified, if necessary, the bladder cancer proteins and nucleic 

acids are useful in a number of applications. They may be used as immunoselection reagents, 
as vaccine reagents, as screening agents, etc. 

Variants of bladder cancer proteins 
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In one embodiment, the bladder cancer proteins are derivative or variant bladder 
cancer proteins as compared to the wild-type sequence. That is, as outlined more fully below, 
the derivative bladder cancer peptide will often contain at least one amino acid substitution, 
deletion, or insertion, with amino acid substitutions being particularly preferred. The amino 
5 acid substitution, insertion, or deletion may occur at most residues within the bladder cancer 
peptide. 

Certain embodiments of bladder cancer proteins of the present invention are amino 
acid sequence variants. These variants typically fall into one or more of three classes: 
substitutional, insertional, or deletional variants. These variants ordinarily are prepared by 

10 site specific mutagenesis of nucleotides in the DNA encoding the bladder cancer protein, 
using cassette or PGR, mutagenesis, or other techniques well known in the art, to produce 
DNA encoding the variant, and thereafter expressing the DNA in recombinant cell culture as 
outlined above. However, variant bladder cancer protein fragments having up to about 100- 
150 residues may be prepared by in vitro synthesis using established techniques. Amino acid 

1 5 sequence variants are often characterized by the predetermined nature of the variation, a 
feature that sets them apart from naturally occurring allelic or interspecies variation of the 
bladder cancer protein amino acid sequence. The variants typically exhibit the same 
qualitative biological activity as the naturally occurring analogue, although variants can also 
be selected which have modified characteristics as will be more fully outlined below. 

20 While the site or region for introducing an amino acid sequence variation is often 

predetermined, the mutation per se need not be predetermined. To optimize the performance 
of a mutation at a given site, random mutagenesis may be conducted at the target codon or 
region and the expressed bladder cancer variants screened for the optimal combination of 
. desired activities. Techniques for making substitution mutations at predetermined sites in 

25 DNA having a known sequence are well known, e.g., Ml 3 primer mutagenesis and PCR 
mutagenesis. Screening of the mutants is performed using assays of bladder cancer protein 
activities. 

Amino acid substitutions are typically of single residues; insertions usually will be on 
the order of from about 1 to 20 amino acids, although considerably larger insertions may be 
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tolerated. Deletions range from about 1-20 residues, although in some cases deletions may be 
much larger. 

Substitutions, deletions, insertions, or combinations thereof may be used to arrive at a 
final derivative. Generally these changes are done on a few amino acids to minimize the 
5 alteration of the molecule. However, larger changes may be tolerated in certain 

circumstances. When small alterations in the characteristics of the bladder cancer protein are 
desired, substitutions are generally made in accordance with the amino acid substitution 
relationships provided in the definition section. 

The variants typically exhibitthe same qualitative biological activity and elicit the 

10 same immune response as the naturally-occurring analog, although variants also are selected 
to modify the characteristics of the bladder cancer proteins as needed. Alternatively, the 
variant may be designed such that the biological activity of the bladder cancer protein is 
altered. For example, glycosylation sites may be altered or removed. 

Substantial changes in function or immunological identity are made by selecting 

1 5 substitutions that are less conservative than those described above. Substitutions may be 

made which more significantly affect: the structure of the polypeptide backbone in the area of 
the alteration, e.g., the alpha-helical or beta-sheet structure; the charge or hydrophobicity of 
the molecule at the target site; or the bulk of the side chain. Substitutions which are expected 
to produce the greatest changes in the polypeptide's properties are those in which (a) a 

20 hydrophilic residue, e.g., serine or threonine is substituted for (or by) a hydrophobic residue, 
e.g., leucine, isoleucine, phenylalanine, valine, or alanine; (b) a cysteine or proline is 
substituted for (or by) another residue; (c) a residue having an electropositive side chain, e.g., 
lysine, arginine, or histidine, is substituted for (or by) an electronegative residue, e.g., 
glutamic acid or aspartic acid; or (d) a residue having a bulky side chain, e.g., phenylalanine, 

25 is substituted for (or by) one not having a side chain, e.g., glycine. 

Covalent modifications of bladder cancer polypeptides are included within the scope 
of this invention. One type of covalent modification includes reacting targeted amino acid 
residues of a bladder cancer polypeptide with an organic derivatizing agent that is capable of 
reacting with selected side chains or the N-or C-terminal residues of a bladder cancer 

30 polypeptide. Derivatization with Afunctional agents is useful, e.g., for crosslinking bladder 
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cancer polypeptides to a water-insoluble support matrix or surface for use in the method for 
purifying anti -bladder cancer polypeptide antibodies or screening assays. Commonly used 
crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N- 
hydroxysuccinimide esters, e.g., esters with 4-azidosalicylic acid, homobifunctional 
5 imidoesters, including disuccinimidyl esters such as 33 L dithiobis(succinimidylpropionate), 
Afunctional maleimides such as bis-N-maleimido-l,8-octane and agents such as methyl- 3- 
((p-azidophenyl)dithio)propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to 
the corresponding glutamic and aspartic residues, respectively, hydroxylation of proline and 

10 lysine, phosphorylation of hydroxyl groups of serine, threonine or tyrosine residues, 

methylation of the amino groups of the lysine, arginine, and histidine side chains (pp. 79-86, 
Creighton (1984) Proteins: Structure and Molecular Properties Freeman), acetylation of the 
N-terminal amine, and amidation of a C-terminal carboxyl group. 

Another type of covalent modification of the bladder cancer polypeptide included 

15 within the scope of this invention comprises altering the native glycosylation pattern of the 
polypeptide. "Altering the native glycosylation pattern" is intended for purposes herein to 
mean deleting one or more carbohydrate moieties found in native sequence bladder cancer 
polypeptide, and/or adding one or more glycosylation sites that are not present in the native 
sequence bladder cancer polypeptide. Glycosylation patterns can be altered in many ways. 

20 For example the use of different cell types to express bladder cancer-associated sequences 
can result in different glycosylation patterns. 

Addition of glycosylation sites to bladder cancer polypeptides may also be 
accomplished blattering the amino acid sequence thereof. The alteration may be made, e.g./ 
by the addition of, or substitution by, one or more serine or threonine residues to the native 

25 sequence bladder cancer polypeptide (for O-linked glycosylation sites). The bladder cancer 
amino acid sequence may optionally be altered through changes at the DNA level, 
particularly by mutating the DNA encoding the bladder cancer polypeptide at preselected 
bases such that codons are generated that will translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the bladder 

30 cancer polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. 
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Such methods are described in the art, e.g., in WO 87/05330, and pp. 259-306 in Aplin and 
Wriston (1981) CRC Crit Rev. Biochem. 

Removal of carbohydrate moieties present on the bladder cancer polypeptide may be 
accomplished chemically or enzymatically or by mutational substitution of codons encoding 
5 amino acid residues that serve as targets for glycosylation. Chemical deglycosylation 
techniques are known in the art. See, e.g., Hakimuddin, et al. (1987) Arch. Biochem. 
Biophvs. 259:52-57; and Edge, et al. (1981) Anal. Biochem. 118:131-137. Enzymatic 
cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety of 
endo-and exo-glycosidases. See, e.g., Thotakura, et al. (1987) Meth. Enzvmol. 138:350-359. 

10 Another type of covalent modification of bladder cancer comprises linking the 

bladder cancer polypeptide to one of a variety of nonproteinaceous polymers, e.g., 
polyethylene glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in 
U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192; or 4,179,337. 

Bladder cancer polypeptides of the present invention may also be modified to form 

1 5 chimeric molecules comprising a bladder cancer polypeptide fused to a heterologous 

polypeptide or amino acid sequence. In one embodiment, a chimeric molecule comprises a 
fusion of a bladder cancer polypeptide with an epitope tag. The epitope tag is generally 
placed at the amino-or carboxyl-terminus of the bladder cancer polypeptide. The presence of 
such epitope-tagged forms of a bladder cancer polypeptide can be detected using an antibody 

20 against the tag polypeptide. Also, provision of the epitope tag enables the bladder cancer 
polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a bladder cancer polypeptide with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 

25 chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the art. 
Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; HIS6 
and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field, et al. 
(1988) Mol. Cell. BioL 8:2159-2165); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7, and 

30 9E10 antibodies thereto (Evan, et al. (1985) Molecular and Cellular Biology 5:3610-3616); 
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and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky, et al. 
(1990) Protein Engineering 3:547-553). Other tag polypeptides include the Flag-peptide 
(Hopp, et al. (1988) BioTechnology 6:1204-1210); the KT3 epitope peptide (Martin, et al. 
(1992) Science 255:192-194); tubulin epitope peptide (Skinner, et al. (1991) J. Biol. Chem. 
5 266:1516345166); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth, et al. (1990) 
Proc. Nat'l Acad. Scj USA 87:6393-6397). 

Also included are other bladder cancer proteins of the bladder cancer family, and 
bladder cancer proteins from other organisms, which are cloned and expressed as outlined 
below. Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be 

1 0 used to find other related bladder cancer proteins from humans or other organisms. As will 
be appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include the unique areas of the bladder cancer nucleic acid sequence. Preferred PCR primers 
are from about 15-35 nucleotides in length, with from about 20-30 being preferred, and may 
contain inosine as needed. The conditions for the PCR reaction are well known. See, e.g., 

15 Innis (1990) PCR Protocols, supra. 

Antibodies to bladder cancer proteins 

In a preferred embodiment, when the bladder cancer protein is to be used to generate 
antibodies, e.g., for immunotherapy or immunodiagnosis, the bladder cancer protein should 
20 share at least one epitope Or determinant with the fUll length protein. By "epitope" or 

"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 
made to a smaller bladder cancer protein will be able to bind to the full-length protein, 
particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
25 antibodies generated to a unique epitope show little or no cross-reactivity. 

Methods of preparing polyclonal antibodies are known (see, e.g., Coligan, supra; and 
Harlow and Lane, supra). Polyclonal antibodies can be raised in a mammal, e.g., by one or 
more injections of an immunizing agent and, if desired, an adjuvant. Typically, the 
immunizing agent and/or adjuvant will be injected in the mammal by multiple subcutaneous 
30 or intraperitoneal inj ections. The immunizing agent may include a protein encoded by a 
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, nucleic acid of the tables or fragment thereof or a fusion protein thereof It may be useful to 
conjugate the immunizing agent to a protein known to be immunogenic in the mammal being 
immunized. Examples of such immunogenic proteins include but are not limited to keyhole 
limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. 
5 Examples of adjuvants which may be employed include Freund's complete adjuvant and 
MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). The 
immunization protocol may be selected as appropriate. 

The antibodies may be monoclonal antibodies. Monoclonal antibodies may be 
prepared using hybridoma methods, such as those described by Kohler and Milstein (1975) 

1 0 Nature 256:495-497. In a hybridoma method, a mouse, hamster, or other appropriate host 
animal, is typically immunized with an immunizing agent to elicit lymphocytes that produce 
or are capable of producing antibodies that will specifically bind to the immunizing agent. 
Alternatively, the lymphocytes may be immunized in vitro. The immunizing agent will 
typically include a polypeptide encoded by a nucleic acid of Tables 1 A- 13 or fragment 

15 thereof, or a fusion protein thereof. Generally, either peripheral blood lymphocytes ("PBLs") 
are used if cells of human origin are desired, or spleen cells or lymph node cells are used if 
non-human mammalian sources are desired. The lymphocytes are then fused with an 
immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
hybridoma cell (pp. 59-103, Goding (1986) Monoclonal Antibodies: Principles and Practice 

20 Academic Press). Immortalized cell lines are usually transformed mammalian cells, 
particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse 
myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture 
medium that preferably contains one or more substances that inhibit the growth or survival of 
the unfiised, immortalized cells. For example, if the parental cells lack the enzyme 

25 hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT 
medium"), which substances prevent the growth of HGPRT-deficient cells. 

In one embodiment, the antibodies are bispecific antibodies. Bispecific antibodies are 
typically monoclonal, preferably human or humanized, antibodies that have binding 

30 specificities for at least two different antigens or that have binding specificities for two 
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epitopes on the same antigen. In one embodiment, one of the binding specificities is for a t . 
protein encoded by a nucleic acid Tables 1 A- 13 or a fragment thereof, the other one is for 
another antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
5 multivalent reagents. • 

In a preferred embodiment, the antibodies to bladder cancer protein are capable of 
reducing or eliminating a biological function of a bladder cancer protein, as is described 
below. That is, the addition of anti-bladder cancer protein antibodies (either polyclonal or 
preferably monoclonal) to bladder cancer tissue (or cells containing bladder cancer) may 

10 reduce or eliminate the bladder cancer. Generally, at least about 25% decrease in activity, 
growth, size, or the like is preferred, with at least about 50% being particularly preferred, and 
about a 95- 1 00% decrease being especially preferred. 

In a preferred embodiment the antibodies to the bladder cancer proteins are 
humanized antibodies (e.g., Xenerex Biosciences; Medarex, Inc.; Abgenix, Inc.; Protein 

1 5 Design Labs, Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 
Fab, Fab 1 , F(ab')2 or other antigen-binding subsequences of antibodies) which contain 
minimal sequence derived from non-human immunoglobulin. Humanized antibodies include 
human immunoglobulins (recipient antibody) in which residues from a complementary 

20 determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat, or rabbit having the desired specificity, 
affinity and capacity. In some instances, Fv framework residues of the human 
immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 

25 imported CDR or framework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the framework (FR) regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 

30 at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
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immunoglobulin. See Jones, et al. (1986) Nature 321:522-525; Riechmann, et al. (1988) 
Nature 332:323-329; and Presta (1992) Curr. On. Struct. Biol 2:593-596. Humanization can 
be performed, e.g., following the method of Winter and co-workers (see Jones, et al. (1986) 
Nature 321:522-525; Riechmann, et al. (1988) Nature 332:323-327; Verhoeyen, et al. (1988) 
5 Science 239:1534-1536), by substituting rodent CDRs or CDR sequences for the 

corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 
human variable domain has been substituted by the corresponding sequence from a non- 
human species. 

1 0 Human antibodies can also be produced using various techniques known in the art, 

including phage display libraries (Hoogenboom and Winter (1991) J. Mol. Biol. 227:381- 
388; Marks, et al (1991) J. MoL Biol. 222:581-597) or of human monoclonal antibodies 
(e.g., p. 77, Cole, et al. in Reisfeld and Sell (1985) Monoclonal Antibodies and Cancer 
Therapy Liss; and Boerner, et al. (1991) J. Immunol. 147:86-95). Similarly, human 

1 5 antibodies can be made by introducing of human immunoglobulin loci into transgenic 

animals, e.g., mice in which the endogenous immunoglobulin genes have been partially or 
completely inactivated. Upon challenge, human antibody production is observed, which 
closely resembles that seen in humans in all respects, including gene rearrangement, 
assembly, and antibody repertoire. This approach is described, e.g., in U.S. Patent Nos. 

20 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, and Marks, et al. (1992) 
Bio/Technology 10:779-783; Lonberg, et al. (1994) Nature 368:856-859; Morrison (1994) 
Nature 368:812-13; Neuberger (1996) Nature Biotechnology 14:826 commenting on 
Fishwild, et al. (1996) Nature Biotechnology 14:845-51; and Lonberg and Huszar (1995) 
Intern. Rev. Immunol. 13:65-93. 

25 By immunotherapy is meant treatment of bladder cancer with an antibody raised 

against bladder cancer proteins. As used herein, immunotherapy can be passive or active. 
Passive immunotherapy as defined herein is the passive transfer of antibody to a recipient 
(patient), which may be used to target a label or toxin. Active immunization is the induction 
of antibody and/or T-cell responses in a recipient (patient). Induction of an immune response 

30 is the result of providing the recipient with an antigen to which antibodies are raised. As 
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appreciated by one of ordinary skill in the art, the antigen may be provided by injecting a 
polypeptide against which antibodies are desired to be raised into a recipient, or contacting 
the recipient with a nucleic acid capable of expressing the antigen and under conditions for 
expression of the antigen, leading to an immune response. 
5 In a preferred embodiment the bladder cancer proteins against which antibodies are 

raised are secreted proteins as described above. Without being bound by theory, antibodies 
used for treatment, bind and prevent the secreted protein from binding to its receptor, thereby 
inactivating the secreted bladder cancer protein. 

In another preferred embodiment, the bladder cancer protein to which antibodies are 

10 raised is a transmembrane protein. Without being bound by theory, antibodies used for 
treatment, bind the extracellular domain of the bladder cancer protein and prevent it from 
binding to other proteins, such as circulating iigands or cell-associated molecules. The 
antibody may cause down-regulation of the transmembrane bladder cancer protein. As will 
be appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 

1 5 competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
bladder cancer protein. The antibody is also an antagonist of the bladder cancer protein. 
Further, the antibody prevents activation of the transmembrane bladder cancer protein. In 
one aspect, when the antibody prevents the binding of other molecules to the bladder cancer 
protein, the antibody prevents growth of the cell. The antibody may also be used to target or 

20 sensitize the cell to cytotoxic agents, including, but not limited to TNF-a, TNF-p, EL-1, ENfF-y 
and IL-2, or chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, 
methotrexate, and the like. In some instances the antibody belongs to a sub-type that 
activates serum complement when complexed with the transmembrane protein thereby 
mediating cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, bladder cancer is 

25 treated by administering to a patient antibodies directed against the transmembrane bladder 
cancer protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 
otherwise provide means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector moiety. 
The effector moiety can be a number of molecules, including labelling moieties such as 

30 radioactive labels or fluorescent labels, or can be a therapeutic moiety. In one aspect the 
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therapeutic moiety is a small molecule that modulates the activity of the bladder cancer 
protein. In another aspect the therapeutic moiety modulates the activity of molecules 
associated with or in close proximity to the bladder cancer protein. The therapeutic moiety 
may inhibit enzymatic activity such as protease or collagenase or protein kinase activity 
5 associated with bladder cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic agent. In 
this method, targeting the cytotoxic agent to bladder cancer tissue or cells, results in a 
reduction in the number of afflicted cells, thereby reducing symptoms associated with bladder 
cancer. Cytotoxic agents are numerous and varied and include, but are not limited to, 

10 cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins and their 

corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
chain, curcin, crotin, phenomycin, enomycin, and the like. Cytotoxic agents also include 
radiochemicals made by conjugating radioisotopes to antibodies raised against bladder cancer 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 

1 5 the antibody. Targeting the therapeutic moiety to transmembrane bladder cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the bladder cancer 
afflicted area, but also serves to reduce deleterious side effects that may be associated with 
the therapeutic moiety. 

In another preferred embodiment, the bladder cancer protein against which the 

20 antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
to a protein which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
the individual or cell. Moreover, wherein the bladder cancer protein can be targeted within a 
cell, e.g., the nucleus, an antibody thereto contains a signal for that target localization, e.g., a 

25 nuclear localization signal. 

The bladder cancer antibodies of the invention specifically bind to bladder cancer 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 

of at least about 0.1 mM, more usually at least about 1 |iM, preferably at least about 0.1 
jiM or better, and most preferably, 0.01 \M or better. Selectivity of binding is also 

30 important. 
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Detection of bladder cancer sequence for diagnostic and therapeutic applications 

In one aspect, the RNA expression levels of genes are determined for different 
cellular states in the bladder cancer phenotype. Expression levels of genes in normal tissue 
5 (e.g., not experiencing bladder cancer) and in bladder cancer tissue (and in some cases, for 
varying severities of bladder cancer that relate to prognosis, as outlined below), or in non- 
malignant disease, are evaluated to provide expression profiles. An expression profile of a 
particular cell state or point of development is essentially a "fingerprint" of the state. While 
two states may have a particular gene similarly expressed, the evaluation of a number of 

10 genes simultaneously allows the generation of a gene expression profile that is reflective of 
the state of the cell. By comparing expression profiles of cells in different states, information 
regarding which genes are important (including both up- and down-regulation of genes) in 
each of these states is obtained. Then, diagnosis may be performed or confirmed to 
determine whether a tissue sample has the gene expression profile of normal or cancerous 

1 5 tissue. This will provide for molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g., 

20 normal versus bladder cancer tissue. Genes may be turned on or turned off in a particular 
state, relative to another state thus permitting comparison of two or more states. A 
qualitatively regulated gene will exhibit an expression pattern within a state or cell type 
which is detectable by standard techniques. Some genes will be expressed in one state or cell 
type, but not in both. Alternatively, the difference in expression may be quantitative, e.g., in 

25 that expression is increased or decreased; e.g., gene expression is either upregulated, resulting 
in an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Affymetrix 
GeneChip™ expression arrays. See Lockhart (1996) Nature Biotechnology 14:1675-1680. 

30 Other techniques include, but are not limited to, quantitative reverse transcriptase PCR, 
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northern analysis and RNase protection. As outlined above, preferably the change in 
expression (e.g., upregulation or downregulation) is at least about 50%, more preferably at 
least about 100%, more preferably at least about 150%, more preferably at least about 200%, 
with from about 300-1000% being especially preferred. 
5 Evaluation may be at the gene transcript, or the protein level. The amount of gene 

. expression may be monitored using nucleic acid probes to the DNA or RNA equivalent of the 
gene transcript, and the quantification of gene expression levels, or, alternatively, the final 
gene product itself (protein) can be monitored, e.g., with antibodies to the bladder cancer 
protein and standard immunoassays (ELIS As, etc.) or other techniques, including mass 

10 spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to bladder 
cancer genes, e.g., those identified as being important in a bladder cancer or disease 
phenotype, can be evaluated in a bladder cancer diagnostic test. 

In a preferred embodiment, gene expression monitoring is performed simultaneously 
on a number of genes. Multiple protein expression monitoring can be performed as well. 

1 5 Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the bladder cancer nucleic acid probes are attached to biochips as 
outlined herein for the detection and quantification of bladder cancer sequences in a 
particular cell. The assays are further described below in the example. PCR techniques can 
be used to provide greater sensitivity. 

20 In a preferred embodiment nucleic acids encoding the bladder cancer protein are 

detected. Although DNA or RNA encoding the bladder cancer protein may be detected, of 
particular interest are methods wherein an mRNA encoding a bladder cancer protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 
complementary to and hybridizes with the mRNA and includes, but is not limited to, 

25 oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 
examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
detected. In another method detection of the mRNA is performed in situ. In this method 

30 permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
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probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 
washing to remove the non-specifically bound probe, the label is detected. For example, a 
digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding a 
bladder cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 
5 secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins as 
described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The bladder cancer proteins, antibodies, nucleic acids, modified proteins and cells 
1 0 containing bladder cancer sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

1 5 As described and defined herein, bladder cancer proteins, including intracellular, 

transmembrane or secreted proteins, find use as diagnostic or prognostic markers of bladder 
cancer, or to assist in selecting theipay based on expression profile and archival data. 
Detection of these proteins in putative bladder cancer tissue allows for detection or diagnosis 
of bladder cancer. In one embodiment, antibodies are used to detect bladder cancer proteins. 

20 A preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the bladder cancer protein is 
detected, e.g., by immunoblotting with antibodies raised against the bladder cancer protein. 
Methods of immunoblotting are well known to those of ordinary skill in the art. 
" 25 In another preferred method, antibodies to the bladder cancer protein find use in in 

situ imaging techniques, e.g., in histology (e.g., Asai (ed. 1993) "Antibodies in Cell Biology" 
Methods in Cell Biology (vol. 37). In this method cells are contacted with from one to many 
antibodies to the bladder cancer protein(s). Following washing to remove non-specific 
antibody binding, the presence of the antibody or antibodies is detected. In one embodiment 

30 the antibody is detected by incubating with a secondary antibody that contains a detectable 
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label. In another method the primary antibody to the bladder cancer protein(s) contains a 
detectable label, e.g. an enzyme marker that can act on a substrate. In another preferred 
embodiment each one of multiple primary antibodies contains a distinct and detectable label. 
This method finds particular use in simultaneous screening for a plurality of bladder cancer 
5 proteins. As will be appreciated by one of ordinary skill in the art, many other histological 
imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the ability 
to detect and distinguish emissions of different wavelengths. In addition, a fluorescence 
activated cell sorter (FACS) can be used in the method. 

10 In another preferred embodiment, antibodies find use in diagnosing bladder cancer 

from blood, serum, plasma, stool, urine, and other samples. Such samples, therefore, are 
useful as samples to be probed or tested for the presence of bladder cancer proteins. 
Antibodies can be used to detect a bladder cancer protein by previously described 
immunoassay techniques including ELIS A, immunoblotting (western blotting), 

15 immunoprecipitation, BIACORE technology and the like. Conversely, the presence of 

antibodies may indicate an immune response against an endogenous bladder cancer protein. 

In a preferred embodiment, in situ hybridization of labeled bladder cancer nucleic 
acid probes to tissue arrays is done. For example, arrays of tissue samples, including bladder 
cancer tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, supra) 

20 is then performed. When comparing the fingerprints between an individual and a standard, 
the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the findings. It 
is further understood that the genes which indicate the diagnosis may differ from those which 
indicate the prognosis and molecular profiling of the condition of the cells may lead to 
distinctions between responsive or refractory conditions or may be predictive of outcomes. 

25 In a preferred embodiment, the bladder cancer proteins, antibodies, nucleic acids, 

modified proteins and cells containing bladder cancer sequences are used in prognosis assays. 
As above, gene expression profiles can be generated that correlate to bladder cancer, clinical, 
pathological, or other information, e.g., in terms of long term prognosis. Again, this may be 
done on either a protein or gene level, with the use of genes being preferred. Single or 

30 multiple genes may be useful in various combinations. As above, bladder cancer probes may 
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be attached to biochips for the detection and quantification of bladder cancer sequences in a 
tissue or patient. The assays proceed as outlined above for diagnosis. PCR methods may 
provide more sensitive and accurate quantification. 

5 Assays for therapeutic compounds 

In a preferred embodiment members of the proteins, nucleic acids, and antibodies as 
described herein are used in drug screening assays. The bladder cancer proteins, antibodies, 
nucleic acids, modified proteins and cells containing bladder cancer sequences are used in 
drug screening assays or by evaluating the effect of drug candidates on a "gene expression 

10 profile" or expression profile of polypeptides. In a preferred embodiment, the expression 
profiles are used, preferably in conjunction with high throughput screening techniques to 
allow monitoring for expression profile genes after treatment with a candidate agent. See, 
e.g., Zlokarnik, et al. (1998) Science 279:84-88; and Heid (1996) Genome Res. 6:986-94. 
In a preferred embodiment, the bladder cancer proteins, antibodies, nucleic acids, 

1 5 modified proteins and cells containing the native or modified bladder cancer proteins are used 
in screening assays. That is, the present invention provides novel methods for screening for 
compositions which modulate the bladder cancer phenotype or an identified physiological 
function of a bladder cancer protein. As above, this can be done on an individual gene level 
or by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 

20 embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 

Having identified the differentially expressed genes herein, a variety of assays may be 
executed. In a preferred embodiment, assays may be run on an individual gene or protein 

25 level. That is, having identified a particular gene as up regulated in bladder cancer, test 

compounds can be screened for the ability to modulate gene expression or for binding to the 
bladder cancer protein. "Modulation" thus includes both an increase and a decrease in gene 
expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing bladder cancer, with changes of at least 

30 about 10%, preferably about 50%, more preferably about 100-300%, and in some 
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embodiments about 300-1000% or greater. Thus, if a gene exhibits about 4-fold increase in 
bladder cancer tissue compared to normal tissue, a decrease of about four-fold is often 
desired; similarly, about 10-fold decrease in bladder cancer tissue compared to normal tissue 
often provides a target value of about 10-fold increase in expression to be induced by the test 
5 compound. 

The amount of gene expression may be monitored using nucleic acid probes and the 
quantification of gene expression levels, or, alternatively, the gene product itself can be 
monitored, e.g., through the use of antibodies to the bladder cancer protein and standard 
immunoassays. Proteomics and separation techniques may also allow quantification of 
.10 expression. 

In a preferred embodiment, gene expression or protein monitoring of a number of 
entities, e.g., an expression profile, is monitored simultaneously. Such profiles will typically 
involve a plurality of those entities described herein.. 

In this embodiment, the bladder cancer nucleic acid probes are attached to biochips as 

1 5 outlined herein for the detection and quantification of bladder cancer sequences in a 

particular cell. Alternatively, PGR may be used. Thus, a series, e.g., of microtiter plate, may 
be used with dispensed primers in desired wells. A PCR reaction can then be performed and 
analyzed for each well. 

Expression monitoring can be performed to identify compounds that modify the 

20 expression of one or more bladder cancer-associated sequences, e.g., a polynucleotide 

sequence set out inTables 1A-13. Generally, in a preferred embodiment, a test modulator is 
added to the cells prior to analysis. Moreover, screens are also provided to identify agents 
that modulate bladder cancer, modulate bladder cancer proteins, bind to a bladder cancer 
protein, or interfere with the binding of a bladder cancer protein and an antibody or other 

25 binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes a molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
indirectly alter the bladder cancer phenotype or the expression of a bladder cancer sequence, 

30 e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 
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expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses a bladder cancer phenotype, e.g., to a normal tissue or 
non-malignant fingerprint. In another embodiment, a modulator induced a bladder cancer 
phenotype. Generally, a plurality of assay mixtures are run in parallel with different agent 
5 concentrations to obtain a differential response to the various concentrations. Typically, one 
of these concentrations serves as a negative control, e.g., at zero concentration or below the 
level of detection. 

Drug candidates encompass numerous chemical classes, though typically they are 
organic molecules, preferably small organic compounds having a molecular weight of more 

10 than about 100 and less than about 2,500 daltons. Preferred small molecules are less than 
about 2000, or less than about 1500 or less than about 1000 or less than about 500 D. 
Candidate agents comprise functional groups necessary for structural interaction with 
proteins, particularly hydrogen bonding, and typically include at least an amine, carbonyl, 
hydroxyl, or carboxyl group, preferably at least two of the functional chemical groups. The 

15 candidate agents often comprise cyclical carbon or heterocyclic structures and/or aromatic .or 
polyaromatic structures substituted with one or more of the above functional groups. 
Candidate agents are also found among biomolecules including peptides, saccharides, fatty 
acids, steroids, purines, pyrimidines, derivatives, structural analogs or combinations thereof. 
Particularly preferred are peptides. 

20 In one aspect, a modulator will neutralize the effect of a bladder cancer protein. By 

"neutralize" is meant that activity of a protein is inhibited or blocked and the consequent 
effect on the cell. 

In certain embodiments, combinatorial libraries of potential modulators will be 
screened for an ability to bind to a bladder cancer polypeptide or to modulate activity. 
25 Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 
are employed for such an analysis. 
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In one preferred embodiment, high throughput screening methods involve providing a 
library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 
5 display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical compounds 
generated by either chemical synthesis or biological synthesis by combining a number of 
chemical "building blocks" such as reagents. For example, a linear combinatorial chemical 

10 library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of chemical 
building blocks called amino acids in every possible way for a given compound length (e.g., 
the number of amino acids in a polypeptide compound). Millions of chemical compounds 
can be synthesized through such combinatorial mixing of chemical building blocks. See, e.g., 
Gallop, et al. (1994) J. Med. Chem. 37:1233-1251. 

1 5 Preparation and screening of combinatorial chemical libraries is well known to those 

of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka (1991) Pept. Prot. Res. 37:487- 
493, Houghton, et al. (1991) Nature 354:84-88); peptoids (PCT Publication No WO 
91/19735); encoded peptides (PCT Publication WO 93/20242); random bio-oligomers (PCT 

20 Publication WO 92/00091); benzodiazepines (U.S. Pat. No. 5,288,514); diversomers such as 
hydantoins, benzodiazepines, and dipeptides (Hobbs, et al. (1993) Proc. Nat ! l Acad. Sci. USA 
90:6909-6913); vinylogous polypeptides (Hagihara, et al. (1992) J. Amer. Chem. Soc. 
1 14:6568-6570); nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding 
(Hirschmann, et al. (1992) J. Amer. Chem. Soc. 114:9217-9218); analogous organic 

25 syntheses of small compound libraries (Chen, et al. (1994) J. Amer. Chem. Soc. 1 16:2661- 
2662); oligocarbamates (Cho, et al. (1993) Science 261:1303-1305); and/or peptidyl 
phosphonates (Campbell, et al. (1994) J. Org. Chem. 59:658-xxx). See, generally, Gordon, et 
al. (1994) J. Med. Chem. 37:1385-1401, nucleic acid libraries (see, e.g., Strategene, Corp.); 
peptide nucleic acid libraries (see, e.g., U.S. Patent 5,539,083); antibody libraries (see, e.g., 

30 Vaughn, et al. (1996) Nature Biotechnology 14:309-314, and PCT/US96/10287); 
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carbohydrate libraries (Liang, et ai. (1996) Science 274:1520-1522, and U.S. Patent No. 
5,593,853); and small organic molecule libraries (see, e.g., benzodiazepines, Baum (p. 33, 
Jan 18, 1993) C&ENews); isoprenoid (U.S. Patent No. 5,569,588); thiazolidinones and 
metathiazanones (U.S. Patent No. 5,549,974); pyrrolidines (U.S. Patent Nos. 5,525,735 and 
5 5,519,134); morpholino compounds (U.S. Patent No. 5,506,337); benzodiazepines (U.S. 
Patent No. 5,288,5 14); and the like. 

Devices for the preparation of combinatorial libraries are commercially available. 
See, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY; Symphony, Rainin, 
Woburn, MA; 433A Applied Biosystems, Foster City, CA; and 9050 Plus, Millipore, 
10 Bedford, MA. 

A number of well known robotic systems have also been developed for solution phase 
chemistries. These systems include automated workstations like the automated synthesis 
apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic 
systems utilizing robotic amis (Zymate II, Zymark Corporation, Hopkinton, MA; Orca, 

15 Hewlett-Packard, Palo Alto, CA), which mimic the manual synthetic operations performed by 
a chemist. The above devices are suitable for use with the present invention. The nature and 
implementation of modifications to these devices (if any) so that they can operate as 
discussed herein will be apparent to persons skilled in the relevant art. In addition, numerous 
combinatorial libraries are themselves commercially available (see, e.g., ComGenex, 

20 Princeton, NJ; Asinex, Moscow, Ru; Tripos, Inc., St. Louis, MO; ChemStar, Ltd, Moscow, 
RU; 3D Pharmaceuticals, Exton, PA; Martek Biosciences, Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of bladder cancer gene transcription, 
inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 

25 polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other properties 
of particular nucleic acids or protein products are well known to those of skill in the art. 
Similarly, binding assays and reporter gene assays are similarly well known. Thus, e.g., U.S. 
Patent No. 5,559,410 discloses high throughput screening methods for proteins, U.S. Patent 

30 No. 5,585,639 discloses high throughput screening methods for nucleic acid binding (e.g., in 
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arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high throughput methods of 
screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available (see, e.g., 
Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
5 Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA; etc.). These systems 
typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 
as a high degree of flexibility and customization. The manufacturers of such systems provide 

10 detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring proteins or 
fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing proteins, or 

1 5 random or directed digests of proteinaceous cellular extracts, may be used. In this way 

libraries of proteins may be made for screening in the methods of the invention. Particularly 
preferred in this embodiment are libraries of bacterial, fungal, viral, and mammalian proteins, 
with the latter being preferred, and human proteins being especially preferred. Particularly 
usefUl test compound will be directed to the class of proteins to which the target belongs, e.g., 

20 substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5-30 amino acids, 
with from about 5-20 amino acids being preferred, and from about 7-15 being particularly 
preferred. The peptides may be digests of naturally occurring proteins as is outlined above, 
random peptides, or "biased" random peptides. By "randomized" or grammatical equivalents 

25 herein is meant that each nucleic acid and peptide consists of essentially random nucleotides 
and amino acids, respectively. Since generally these random peptides (or nucleic acids, 
discussed below) are chemically synthesized, they may incorporate nucleotide or amino acid 
substitutions. The synthetic process can be designed to generate randomized proteins or 
nucleic acids, to allow the formation of all or most of the possible combinations over the 
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length of the sequence, thus forming a library of randomized candidate bioactive 
proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence preferences or 
constants. In a preferred embodiment, the library is biased. That is, some positions within 
5 the sequence are either held constant, or are selected from a limited number of possibilities. 
For example, in a preferred embodiment, the nucleotides or amino acid residues are 
randomized within a defined class, e.g., of hydrophobic amino acids, hydrophilic residues, 
sterically biased (either small or large) residues, towards the creation of nucleic acid binding 
domains, the creation of cysteines, for cross-linking, prolines for SH-3 domains, serines, 

10 threonines, tyrosines or histidines for phosphorylation sites, etc., or to purines, etc. 

Modulators of bladder cancer can also be nucleic acids, as defined above. 
As described above generally for proteins, nucleic acid modulating agents may be 
naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
Digests of procaryotic or eucaryotic genomes may be used as is outlined above for proteins. 

15 In a preferred embodiment, the candidate compounds are organic chemical moieties, a 

wide variety of which are available in the literature. 

After the candidate agent has been added and the cells allowed to incubate for some 
period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 

20 the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
with purification and/or amplification such as PCR performed as appropriate. For example, 
an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a fluorescent, a 

25 chemiluminescent, a chemical, or a radioactive signal, to provide a means of detecting the 
target sequence's specific binding to a probe. The label also can be an enzyme, such as, 
alkaline phosphatase or horseradish peroxidase, which when provided with an appropriate 
substrate produces a product that can be detected. Alternatively, the label can be a labeled 
compound or small molecule, such as an enzyme inhibitor, that binds but is not catalyzed or 

30 altered by the enzyme. The label also can be a moiety or compound, such as, an epitope tag 
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or biotin which specifically binds to streptavidin. For the example of biotin, the streptavidin 
is labeled as described above, thereby, providing a detectable signal for the bound target 
sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct hybridization 
5 assays or can comprise "sandwich assays' 1 , which include the use of multiple probes, as is 
generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 5,594,117, 
5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 5,594,118, 
5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by reference. In 
this embodiment, in general, the target nucleic acid is prepared as outlined above, and then 

1 0 added to the biochip comprising a plurality of nucleic acid probes, under conditions that 
allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, including 
high, moderate and low stringency conditions as outlined above. The assays are generally 
run under stringency conditions which allows formation of the label probe hybridization 

1 5 complex only in the presence of target. Stringency can be controlled by altering a step 
parameter that is a thermodynamic variable, including, but not limited to, temperature, 
formamide concentration, salt concentration, chaotropic salt concentration pH, organic 
solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally 

20 outlined in US. Patent No. 5,681,697. Thus it may be desirable to perform certain steps at 
higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. Components 
of the reaction may be added simultaneously, or sequentially, in different orders, with 
preferred embodiments outlined below. In addition, the reaction may include a variety of 

25 other reagents. These include salts, buffers, neutral proteins, e.g., albumin, detergents, etc., 
which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 
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The assay data are analyzed to determine the expression levels, and changes in 
expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the bladder cancer phenotype. In one 

embodiment, screening is performed to identify modulators that can induce or suppress a 

5 particular expression profile, thus preferably generating the associated phenotype. In another 

embodiment, e.g., for diagnostic applications, having identified differentially expressed genes 

important in a particular state, screens can be performed to identify modulators that alter 

expression of individual genes. In an another embodiment, screening is performed to identify 

modulators that alter a biological function of the expression product of a differentially 

10 expressed gene. Again, having identified the importance of a gene in a particular state, 

screens are performed to identify agents that bind and/or modulate the biological activity of 

» 

the gene product. 

In addition screens can be done for genes that are induced in response to a candidate 
agent. After identifying a modulator based upon its ability to suppress a bladder cancer 

1 5 expression pattern leading to a normal expression pattern, or to modulate a single bladder 
cancer gene expression profile so as to mimic the expression of the gene from normal tissue, 
a screen as described above can be performed to identify genes that are specifically 
modulated in response to the agent. Comparing expression profiles between normal tissue 
and agent treated bladder cancer tissue reveals genes that are not expressed in normal tissue 

20 or bladder cancer tissue, but are expressed in agent treated tissue. These agent-specific 

sequences can be identified and used by methods described herein for bladder cancer genes or 
proteins. In particular these sequences and the proteins they encode find use in marking or 
identifying agent treated cells. In addition, antibodies can be raised against the agent induced 
proteins and used to target novel therapeutics to the treated bladder cancer tissue sample. 

25 Thus, in one embodiment, a test compound is administered to a population of bladder 

cancer cells, that have an associated bladder cancer expression profile. By "administration" 
or "contacting" herein is meant that the candidate agent is added to the cells in such a manner 
as to allow the agent to act upon the cell, whether by uptake and intracellular action, or by 
action at the cell surface. In some embodiments, nucleic acid encoding a proteinaceous 

30 candidate agent (e.g., a peptide) may be put into a viral construct such as an adenoviral or 
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retroviral construct, and added to the cell, such that expression of the peptide agent is 
accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems can also be used. 

Once the test compound has been administered to the cells, the cells can be washed if 
desired and are allowed to incubate under preferably physiological conditions for some 
5 period of time. The cells are then harvested and a new gene expression profile is generated, 
as outlined herein. 

Thus, e.g., bladder cancer or non-malignant tissue may be screened for agents that 
modulate, e.g., induce or suppress the bladder cancer phenotype. A change in at least one 
gene, preferably many, of the expression profile indicates that the agent has an effect on 

10 bladder cancer activity. By defining such a signature for the bladder cancer phenotype, 

screens for new drugs that alter the phenotype can be devised. With this approach, the drug 
target need not be known and need not be represented in the original expression screening 
platform, nor does the level of transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on individual 

15 genes and gene products (proteins). That is, having identified a particular differentially 
expressed gene as important in a particular state, screening of modulators of either the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "bladder cancer proteins 11 
or a "bladder cancer modulatory protein". The bladder cancer modulatory protein may be a 

20 fragment, or alternatively, be the full length protein to the fragment encoded by the nucleic 
acids of the Tables 1 A-13. Preferably, the bladder cancer modulatory protein is a fragment. 
In a preferred embodiment, the bladder cancer amino acid sequence which is used to 
determine sequence identity or similarity is encoded by a nucleic acid of Tables 1 A-13. Li 
another embodiment, the sequences are naturally occurring allelic variants of a protein 

25 encoded by a nucleic acid of Tables 1A-13. In another embodiment, the sequences are 
sequence variants as further described herein. 

Preferably, the bladder cancer modulatory protein is a fragment of approximately 14 
to 24 amino acids long. More preferably the fragment is a soluble fragment. Preferably, the 
fragment includes a non-transmembrane region. In a preferred embodiment, the fragment has 
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an N-terminal Cys to aid in solubility. In one embodiment, the C-terminus of the fragment is 
kept as a free acid and the N-tenninus is a free amine to aid in coupling, e.g., to cysteine. 

In one embodiment the bladder cancer proteins are conjugated to an immunogenic 
agent as discussed herein. In one embodiment the bladder cancer protein is conjugated to 
5 BSA. 

Measurements of bladder cancer polypeptide activity, or of bladder cancer or the 
bladder cancer phenotype can be performed using a variety of assays. For example, the 
effects of the test compounds upon the function of the bladder cancer polypeptides can be 
measured by examining parameters described above. A suitable physiological change that 

10 affects activity can be used to assess the influence of a test compound on the polypeptides of 
this invention. When the functional consequences are determined using intact cells or 
animals, one can also measure a variety of effects such as, in the case of bladder cancer 
associated with tumors, tumor growth, tumor metastasis, neovascularization, hormone 
release, transcriptional changes to both known and uncharacterized genetic markers (e.g., 

1 5 northern blots), changes in cell metabolism such as cell growth or pH changes, and changes 
in intracellular second messengers such as cGMP. In the assays of the invention, mammalian 
bladder cancer polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in vitro. 
For example, a bladder cancer polypeptide is first contacted with a potential modulator and 

20 incubated for a suitable amount of time, e.g., from 0.5-48 hours. In one embodiment, the 

bladder cancer polypeptide levels are determined in vitro by measuring the level of protein or 
mRNA. The level of protein is measured using immunoassays such as western blotting, 
ELISA and the like with an antibody that selectively binds to the bladder cancer polypeptide 
or a fragment thereof. For measurement of mRNA, amplification, e.g., using PCR, LCR, or 

25 hybridization assays, e.g., northern hybridization, RNase protection, dot blotting, are 
preferred. The level of protein or mRNA is detected using directly or indirectly labeled 
detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the bladder cancer protein 

30 promoter operably linked to a reporter gene such as luciferase, green fluorescent protein, 
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CAT, or P-gal. The reporter construct is typically transfected into a cell. After treatment 
with a potential modulator, the amount of reporter gene transcription, translation, or activity 
is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on individual 
5 genes and gene products (proteins). That is, having identified a particular differentially 

expressed gene as important in a particular state, screening of modulators of the expression of 
the gene or the gene product itself can be done. The gene products of differentially expressed 
genes are sometimes referred to herein as "bladder cancer proteins." The bladder cancer 
protein may be a fragment, or alternatively, be the full length protein to a fragment shown 
10 herein. 

In one embodiment, screening for modulators of expression of specific genes is 
performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 

15 differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or isolated 
gene product is used; that is, the gene products of one or more differentially expressed 
nucleic acids are made. For example, antibodies are generated to the protein gene products, 

20 and standard immunoassays are run to determine the amount of protein present. 

Alternatively, cells comprising the bladder cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a bladder cancer 
protein and a candidate compound, and determining the binding of the compound to the 
bladder cancer protein. Preferred embodiments utilize the human bladder cancer protein, 

25 although other mammalian proteins may also be used, e.g., for the development of animal 
models of human disease. In some embodiments, as outlined herein, variant or derivative 
bladder cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the bladder cancer ' 
protein or the candidate agent is non-diffusably bound to an insoluble support having isolated 

30 sample receiving areas (e.g., a microtiter plate, an array, etc.). The insoluble supports may be 
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made of a composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of a convenient shape. Examples of 
suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
5 typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 
with the reagents and overall methods of the invention, maintains the activity of the 

1 0 composition and is nondiffusable. Preferred methods of binding include the use of antibodies 
(which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 
the protein or agent, excess unbound material is removed by washing. The sample receiving 

1 5 areas may then be blocked through incubation with bovine serum albumin (BS A), casein or 
other innocuous protein or other moiety. 

In a preferred embodiment, the bladder cancer protein is bound to the support, and a 
test compound is added to the assay, Alternatively, the candidate agent is bound to the 
support and the bladder cancer protein is added. Novel binding agents include specific 

20 antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 

analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 
protein binding, functional assays (phosphorylation assays, etc.) and the like. 

2 5 The determination of the binding of the test modulating compound to the bladder 

cancer protein may be done in a number of ways. In a preferred embodiment, the compound 
is labeled, and binding determined directly, e.g., by attaching all or a portion of the bladder 
cancer protein to a solid support, adding a labeled candidate agent (e.g., a fluorescent label), 
washing off excess reagent, and determining whether the label is present on the solid support. 

30 Various blocking and washing steps may be utilized as appropriate. 

73 



WO 03/003906 



PCI7US02/21338 



In some embodiments, only one of the components is labeled, e.g., the proteins (or 
proteinaceous candidate compounds) can be labeled. Alternatively, more than one 
component can be labeled with different labels, e.g., 125l f or the proteins and a fluorophor 
for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
' 5 useful. 

In one embodiment, the binding of the test compound is determined by competitive 
binding assay. The competitor is a binding moiety known to bind to the target molecule (e.g., 
a bladder cancer protein), such as an antibody, peptide, binding partner, ligand, etc. Under 
certain circumstances, there may be competitive binding between the compound and the 

10 binding moiety, with the binding moiety displacing the compound. In one embodiment, the 
test compound is labeled. Either the compound, or the competitor, or both, is added first to 
the protein for a time sufficient to allow binding, if present. Incubations may be performed at 
a temperature which facilitates optimal activity, typically between 4 and 40°C. Incubation 
periods are typically optimized, e.g., to facilitate rapid high throughput screening. Typically 

15 between 0. 1 and 1 hour will be sufficient. Excess reagent is generally removed or washed 
away. The second component is then added, and the presence or absence of the labeled 
component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
compound. Displacement of the competitor is an indication that the test compound is binding 

20 to the bladder cancer protein and thus is capable of binding to, and potentially modulating, 
the activity of the bladder cancer protein. In this embodiment, either component can be 
labeled. Thus, e.g., if the competitor is labeled, the presence of label in the wash solution 
indicates displacement by the agent. Alternatively, if the test compound is labeled, the 
presence of the label on the support indicates displacement. 

25 In an alternative embodiment, the test compound is added first, with incubation and 

washing, followed by the competitor. The absence of binding by the competitor may indicate 
that the test compound is bound to the bladder cancer protein with a higher affinity. Thus, if 
the test compound is labeled, the presence of the label on the support, coupled with a lack of 
competitor binding, may indicate that the test compound is capable of binding to the bladder 

30 cancer protein. 
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In a preferred embodiment, the methods comprise differential screening to identity 
agents that are capable of modulating the activity of the bladder cancer proteins. In this 
embodiment, the methods comprise combining a bladder cancer protein and a competitor in a 
first sample. A second sample comprises a test compound, a bladder cancer protein, and a 
5 competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the bladder cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 
agent is capable of binding to the bladder cancer protein. 

10 Alternatively, differential screening is used to identify drug candidates that bind to the 

native bladder cancer protein, but cannot bind to modified bladder cancer proteins. The 
structure of the bladder cancer protein may be modeled, and used in rational drug design to 
synthesize agents that interact with that site. Drug candidates that affect the activity of a 
bladder cancer protein are also identified by screening drugs for the ability to either enhance 

1 5 or reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably control 
and test samples are performed in at least triplicate to obtain statistically significant results. 
Incubation of all samples is for a time sufficient for the binding of the agent to the protein. 
Following incubation, samples are washed free of non-specifically bound material and the 

20 amount of bound, generally labeled agent determined. For example, where a radiolabel is 
employed, the samples may be counted in a scintillation counter to determine the amount of 
bound compound. 

A variety of other reagents may be included in the screening assays. These include 
reagents like salts, neutral proteins, e.g., albumin, detergents, etc. which may be used to 
25 facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
30 compound capable of modulating the activity of a bladder cancer protein. The methods 
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comprise adding a test compound, as defined above, to a cell comprising bladder cancer 
proteins. Many different cell types may be transfected to contain a recombinant nucleic acid 
that encodes a bladder cancer protein. In a preferred embodiment, a library of candidate 
agents are tested on a plurality of cells. 
5 In one aspect, the assays are evaluated in the presence or absence or previous or 

subsequent exposure of physiological signals, e.g., hormones, antibodies, peptides, antigens, 
cytokines, growth factors, action potentials, and pharmacological agents including, e.g., 
chemotherapeutics, radiation, carcinogenics, or other cells (e.g., cell-cell contacts). In one 
example, the determinations are determined at different stages of the cell cycle process. 

10 In this way, compounds that modulate bladder cancer agents are identified. 

Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the bladder cancer protein. Once identified, similar structures are evaluated to identify 
critical structural feature of the compound. 

In one embodiment, a method of inhibiting bladder cancer cell division is provided. 

15 The method comprises administration of a bladder cancer inhibitor. In another embodiment, 
a method of inhibiting bladder cancer is provided. The method comprises administration of a 
bladder cancer inhibitor. In a further embodiment, methods of treating cells or individuals 
with bladder cancer are provided. The method comprises administration of a bladder cancer 
inhibitor. In one embodiment, a bladder cancer inhibitor is an antibody as discussed above. 

20 In another embodiment, the bladder cancer inhibitor is an antisense molecule. 

A variety of cell growth, proliferation, and metastasis assays are known to those of 
skill in the art, as described below. 
Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 

25 transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft agar. The transformed cells, when transfected with tumor 
suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growth or colony formation in suspension assays can be used to identify 

30 modulators of bladder cancer sequences, which when expressed in host cells, inhibit 
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abnormal cellular proliferation and transformation. A therapeutic compound would reduce or 
eliminate the host cells 1 ability to grow in stirred suspension culture or suspended in semi- 
solid media, such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
5 described in Freshney (1994) Culture of Animal Cells: A Manual of Basic Technique (3d ed.) 
Wiley-Liss, herein incorporated by reference. See also, the methods section of Garkavtsev, et 
al. (1996), supra, herein incorporated by reference. 
Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until they 

10 touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 
higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 

1 5 pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with (3H)-thymidine at saturation density is a preferred 

20 method of measuring density limitation of growth. Transformed host cells are transfected 

with a bladder cancer-associated sequence and are grown for 24 hours at saturation density in 

non-limiting medium conditions. The percentage of cells labeling with ( 3 H)-thymidine is 

determined autoradiographically. See, Freshney (1994), supra. 

Growth factor or serum dependence 
25 Transformed cells have a lower serum dependence than their normal counterparts 

(see, e.g., Temin (1966) J. Natl Cancer Inst. 37:167-175; Eagle, et al. (1970) J. Exp. Med. 

1 3 1 :836-879; Freshney (1994), supra). This is in part due to release of various growth factors 

by the transformed cells. Growth factor or serum dependence of transformed host cells can 

be compared with that of control. 
30 Tumor specific markers levels 
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Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells. See, e.g., 
"Angiogenesis, tumor vascularization, and potential interference with tumor growth" pp. 178- 
5 184 in Mihich (ed. 1985) Biological Responses in Cancer Plenum. Similarly, tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g., Folkman (1992) Sem Cancer Biol. 3:89-96. 

Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. See also, Unkeless, et al. (1974) J. Biol. Chem. 249:4295-4305; 

10 Strickland and Beers (1976) J. Biol. Chem. 251:5694-5702; Whur, et al. (1980) Br. J. Cancer 
42:305-312; Gullino "Angiogenesis, tumor vascularization, and potential interference with 
tumor growth" pp. 178-184 in Mihich (ed. 1985) Biological Responses in Cancer. Plenum; 
and Freshney (1985) Anticancer Res . 5:111-130. 
Invasiveness into Matrigel 

1 5 The degree of invasiveness into Matrigel or some other extracellular matrix 

constituent can be used as an assay to identify compounds that modulate bladder cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 

20 gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in Freshney (1984), supra, can be used. Briefly, the level of 
invasion of host cells can be measured by using filters coated with Matrigel or some other 
extracellular matrix constituent. Penetration into the gel, or through to the distal side of the 
filter, is rated as invasiveness, and rated histologically by number of cells and distance 

25 moved, or by prelabeling the cells with 125 I and counting the radioactivity on the distal side 
of the filter or bottom of the dish. See, e.g., Freshney (2000), supra. 
Tumor growth in vivo 

Effects of bladder cancer-associated sequences on cell growth can be tested in 
transgenic or immune-suppressed mice. Knock-out transgenic mice can be made, in which 

30 the bladder cancer gene is disrupted or in which a bladder cancer gene is inserted. Knock-out 
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transgenic mice can be made by insertion of a marker gene or other heterologous gene into 
the endogenous bladder cancer gene site in the mouse genome via homologous 
recombination. Such mice can also be made by substituting the endogenous bladder cancer 
gene with a mutated version of the bladder cancer gene, or by mutating the endogenous 
5 bladder cancer gene, e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 
is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
that possess germ cells partially derived from the mutant cell line. By breeding the chimeric 

10 mice it is possible to obtain a new line of mice containing the introduced genetic lesion. See, 
e.g., Capecchi, et al. (1989) Science 244: 1288-1292. Chimeric targeted mice can be made. 
See Hogan, et al. (1988) Manipulating the Mouse Embryo: A Laboratory Manual CSH Press; 
and Robertson (ed. 1987) Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach IRL Press, Washington, D.C. 

1 5 Alternatively, various immune-suppressed or immune-deficient host animals can be 

used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella, et al. (1974) J. 
Nat f l Cancer Inst. 52:921-930), a SCID mouse, a thymectomized mouse, or an irradiated 
mouse (see, e.g., Bradley, et al. (1978) Br. J. Cancer 38:263-272; Selby, et al. (1980) Br. J. 
Cancer 41:52-61) can be used as a host. Transplantable tumor cells (typically about 10^ 

20 cells) injected into isogenic hosts will produce invasive tumors in a high proportions of cases, 
while normal cells of similar origin will not. In hosts which developed invasive tumors, cells 
expressing a bladder cancer-associated sequences are injected subcutaneously. After a 
suitable length of time, preferably about 4-8 weeks, tumor growth is measured (e.g., by 
volume or by its two largest dimensions) and compared to the control. Tumors that have 

25 statistically significant reduction (using, e.g., Student's T test) are said to have inhibited 
growth. 

Polynucleotide modulators of bladder cancer 

Antisense and RNAi Polynucleotides 
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In certain embodiments, the activity of a bladder cancer-associated protein is down- 
regulated, or entirely inhibited, by the use of antisense polynucleotide, e.g., a nucleic acid 
complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., a bladder cancer protein mRNA, or a subsequence thereof. 
5 Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise naturally- 
occurring nucleotides, or synthetic species formed from naturally-occurring subunits or their 
close homologs. Antisense polynucleotides may also have altered sugar moieties or inter- 

10 sugar linkages. Exemplary among these are the phosphorothioate and other sulfur containing 
species which are known for use in the art. Analogs are comprehended by this invention so 
long as they function effectively to hybridize with the bladder cancer protein mRNA. See, 
e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant means, 

1 5 or can be synthesized in vitro. Equipment for such synthesis is sold by several vendors, 
including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known. 

Antisense molecules as used herein include antisense or sense oligonucleotides. 
Sense oligonucleotides can, e.g., be employed to block transcription by binding to the anti- 

20 sense strand. The antisense and sense oligonucleotide comprise a single-stranded nucleic 
acid sequence (either RNA or DNA) capable of binding to target mRNA (sense) or DNA 
(antisense) sequences for bladder cancer molecules. A preferred antisense molecule is for a 
bladder cancer sequences in Tables 1A-13, or for a ligand or activator thereof. Antisense or 
sense oligonucleotides, according to the present invention, comprise a fragment generally at 

25 least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive 
an antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given 
protein is described in, e.g., Stein and Cohen (1988) Cancer Res. 48:2659-2668; and van der 
Krol, et al. (1988) BioTechniques 6:958-976. 

RNA interference is a mechanism to suppress gene expression in a sequence specific 

30 manner. See, e.g., Brumelkamp, et al. (2002) Sciencexpress (21March2002); Sharp (1999) 
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Genes Dev. 13:139-141; and Cathew (2001) Curr. Op. Ceil Biol. 13:244-248. In mammalian 
cells, short, e.g., 21 nt, double stranded small interfering RNAs (siRNA) have been shown to 
be effective at inducing an RNAi response. See, e.g., Elbashir, et al. (2001) Nature 41 1 :494- 
498. The mechanism may be used to downregulate expression levels of identified genes, e.g., 
treatment of or validation of relevance to disease 
Ribozvmes 

In addition to antisense polynucleotides, ribozymes can be used to target and inhibit 
transcription of bladder cancer-associated nucleotide sequences. A ribozyme is an RNA 
molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes have 
been described, including group I ribozymes, hammerhead ribozvmes, hairpin ribozymes, 
RNase P, and axhead ribozymes. See, e.g., Castanotto, et al. (1994) Adv. in Pharmacology 
25: 289-317 for a general review of the properties of different ribozymes. 

The general features of hairpin ribozymes are described, e.g., in Hampel, et al. (1990) 
Nucl. Acids Res. 18:299-304; European Patent Publication No. 0 360 257; U.S. Patent No. 
5,254,678. Methods of preparing them are well known. See, e.g., WO 94/26877; Ojwang, et 
aI - ( 1993 ) Proc. Nat'l Ac ad. Sci. USA 90:6340-6344; Yamada, et al. (1994) Human Gene 
Therapy 1:39-45; Leavitt, et al.(1995) Proc. Nat'l Acad. Sci. USA 92:699-703; Leavitt, et al. 
(1994) Human Gene Therapy 5:1 1 51-1 9Q : and Yamada, et al. (1994) Virology 205: 121-126. 

Polynucleotide modulators of bladder cancer may be introduced into a cell containing 
the target nucleotide sequence by formation of a conjugate with a ligand binding molecule, as 
described in WO 91/04753. Suitable ligand binding molecules include, but are not limited to, 
cell surface receptors, growth factors, other cytokines, or other ligands that bind to cell 
surface receptors. Preferably, conjugation of the ligand binding molecule does not 
substantially interfere with the ability of the ligand binding molecule to bind to its 
corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of bladder 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 
understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 
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Thus, in one embodiment, methods of modulating bladder cancer in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 
anti-bladder cancer antibody that reduces or eliminates the biological activity of an 
endogenous bladder cancer protein. Alternatively, the methods comprise administering to a 
5 cell or organism a recombinant nucleic acid encoding a bladder cancer protein. This may be 
accomplished in many ways. In a preferred embodiment, e.g., when the bladder cancer 
sequence is down-regulated in bladder cancer, such state may be reversed by increasing the 
amount of bladder cancer gene product in the cell. This can be accomplished, e.g., by 
overexpressing the endogenous bladder cancer gene or administering a gene encoding the 

10 bladder cancer sequence, using known gene-therapy techniques. In a preferred embodiment, 
the gene therapy techniques include the incorporation of the exogenous gene using enhanced 
homologous recombination (EHR), e.g., as described in PCT/US93/03868, hereby 
incorporated by reference in its entirety. Alternatively, e.g., when the bladder cancer 
sequence is up-regulated in bladder cancer, the activity of the endogenous bladder cancer 

1 5 gene is decreased, e.g., by the administration of a bladder cancer antisense nucleic acid. 

In one embodiment, the bladder cancer proteins of the present invention may be used 
to generate polyclonal and monoclonal antibodies to bladder cancer proteins. Similarly, the 
bladder cancer proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify bladder cancer 

20 antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 

embodiment, the antibodies are generated to epitopes unique to a bladder cancer protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The bladder cancer 

antibodies may be coupled to standard affinity chromatography columns and used to purify 

i 

bladder cancer proteins. The antibodies may also be used as blocking polypeptides, as 
25 outlined above, since they will specifically bind to the bladder cancer protein. 

Methods of identifying variant bladder cancer-associated sequences 

Without being bound by theory, expression of various bladder cancer sequences is 
correlated with bladder cancer. Accordingly, disorders based on mutant or. variant bladder 
30 cancer genes may be determined. In one embodiment, the invention provides methods for 
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identifying cells containing variant bladder cancer genes, e.g., determining all or part of the 
sequence of at least one endogenous bladder cancer genes in a cell. This may be 
accomplished using many sequencing techniques. In a preferred embodiment, the invention 
provides methods of identifying the bladder cancer genotype of an individual, e.g., 
determining all or part of the sequence of at least one bladder cancer gene of the individual. 
This is generally done in at least one tissue of the individual, and may include the evaluation 
of a number of tissues or different samples of the same tissue. The method may include 
comparing the sequence of the sequenced bladder cancer gene to a known bladder cancer 
gene, e.g., a wild-type gene. 

The sequence of all or part of the bladder cancer gene can then be compared to the 
sequence of a known bladder cancer gene to determine if differences exist. This can be done 
using many known homology programs, such as Bestfit, etc. In a preferred embodiment, the 
^presence of a difference in the sequence between the bladder cancer gene of the patient and 
the known bladder cancer gene correlates with a disease state or a propensity for a disease 
state, as outlined herein. 

In a preferred embodiment, the bladder cancer genes are used as probes to determine 
the number of copies of the bladder cancer gene in the genome. 

In another preferred embodiment, the bladder cancer genes are used as probes to 
determine the chromosomal localization of the bladder cancer genes. Information such as 
chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the bladder 
cancer gene locus. 

Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a bladder cancer protein or 
modulator thereof, is administered to a patient. By "therapeutically effective dose" herein is 
meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques. See, e.g., Ansel, et al. (1999) Pharmaceutical Dosage Forms and Drug 
Delivery Lippincott; Lieberman (1992) Pharmaceutical Dosage Forms (vols. 1-3) Dekker, 
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ISBN 0824770846, 082476918X, 0824712692, 0824716981; Lloyd (1999) The Art. Science 
and Technology of Pharmaceutical Compounding Amer. Pharma. Assn.; and Pickar (1999) 
Dosage Calculations Thomson. As is known in the art, adjustments for bladder cancer 
degradation, systemic versus localized delivery, and rate of new protease synthesis, as well as 
5 the age, body weight, general health, sex, diet, time of administration, drug interaction and 
the severity of the condition may be necessary, and will be ascertainable with routine 
experimentation by those skilled in the art. USSN 09/687,576, further discloses the use of 
compositions and methods of diagnosis and treatment in bladder cancer and is hereby 
expressly incorporated by reference. 

10 A "patient" for the purposes of the present invention includes both humans and other 

animals, particularly mammals. Thus the methods are applicable to both human therapy and 
veterinary applications. In the preferred embodiment the patient is a mammal, preferably a 
primate, and in the most preferred embodiment the patient is human. 

The administration of the bladder cancer proteins and modulators thereof of the 

1 5 present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 
some instances, e.g., in the treatment of wounds and inflammation, the bladder cancer 
proteins and modulators may be directly applied as a solution or spray. 

20 The pharmaceutical compositions of the present invention comprise a bladder cancer 

protein in a form suitable for administration to a patient. In the preferred embodiment, the 
pharmaceutical compositions are in a water soluble form, such as being present as 
pharmaceutically acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 

25 biological effectiveness of the free bases and that are not biologically or otherwise 

undesirable, formed with inorganic acids such as hydrochloric, acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 

30 methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
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"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred are the ammonium, 
potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 
tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the following: 
carrier proteins such as serum albumin; buffers; fillers such as microcrystalline cellulose, 
lactose, corn and other starches; binding agents; sweeteners and other flavoring agents; 
coloring agents; and polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit dosage 
forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that bladder cancer protein modulators (e.g., antibodies, 
antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing the 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

The compositions for administration will commonly comprise a bladder cancer 
protein modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous 
carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 
contain pharmaceutically acceptable auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
and the like, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, 
sodium lactate and the like. The concentration of active agent in these formulations can vary 
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widely, and will be selected primarily based on fluid volumes, viscosities, body weight and 
the like in accordance with the particular mode of administration selected and the patient's 
needs (e.g., Remington's Pharmaceutical Science (15th ed., 1980) and Hardman and Limbird 
(eds. 2001) Goodman and Oilman: The Pharmacologial Basis of Therapeutics McGraw-Hill. 
5 Thus, a typical pharmaceutical composition for intravenous administration would be 

about 0.1-10 mg per patient per day. Dosages from about 0.1-100 mg per patient per day 
may be used, particularly when the drug is administered to a secluded site and not into the 
blood stream, such as into a body cavity or into a lumen of an organ. Substantially higher 
dosages are possible in topical administration. Actual methods for preparing parenterally 

10 administrable compositions will be known or apparent to those skilled in the art, e.g., 

Remington's Pharmaceutical Science and Goodman and Gilman: The Pharmacologial Basis 
of Therapeutics , supra. 

The compositions containing modulators of bladder cancer proteins can be 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 

1 5 compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 
amount sufficient to cure or at least partially arrest the disease and its complications. An 
amount adequate to accomplish this is defined as a "therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 

20 depending on the dosage and frequency as required and tolerated by the patient. The 

composition should provide a sufficient quantity of the agents of this invention to effectively 
treat the patient. An amount of modulator that is capable of preventing or slowing the 
development of cancer in a mammal is referred to as a "prophylactically effective dose." The 
particular dose required for a prophylactic treatment will depend upon the medical condition 

25 and history of the mammal, the particular cancer being prevented, as well as other factors 
such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 
treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 
recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer based, at least in part, upon gene expression profiles. Vaccine strategies 

30 may be used, in either a DNA vaccine form, or protein vaccine. 
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It will be appreciated that the present bladder cancer protein-modulating compounds 
can be a<taiinistered alone or in combination with additional bladder cancer modulating 
compounds or with other therapeutic agent, e.g., other anti-cancer agents or treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in Tables 1A-13, such as antisense 
polynucleotides or ribozymes, will be introduced into cells, in vitro or in vivo. The present 
invention provides methods, reagents, vectors, and cells useful for expression of bladder 
cancer-associated polypeptides and nucleic acids using in vitro (cell-free), ex vivo, or in vivo 
(cell or organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors, and other methods for introducing cloned genomic DNA, 
cDNA, synthetic DNA or other foreign genetic material into a host cell. See, e.g., Berger and 
Kimmel (1987) Guide to Mo lecular Cloning Techniques from Methods in Enzvmologv (vol. 
152) Academic Press; Ausubel, et al. (eds. 1999 and supplements) Current Protocols in 
Molecular Biology Lippincott; and Sambrook, et al. (1989) Molecular Cloning: A Laboratory 
Manual (2d ed., Vol. 1-3) CSH Press. 

In a preferred embodiment, bladder cancer proteins and modulators are administered 
as therapeutic agents, and can be formulated as outlined above. Similarly, bladder cancer 
genes (including both the full-length sequence, partial sequences, or regulatory sequences of 
the bladder cancer coding regions) can be administered in a gene therapy application. These 
bladder cancer genes can include antisense applications, either as gene therapy (e.g., for 
incorporation into the genome) or as antisense compositions, as will be appreciated by those 
in the art. 

Bladder cancer polypeptides and polynucleotides can also be administered as vaccine 
compositions to stimulate HTL, CTL, and antibody responses.. Such vaccine compositions 
can include, e.g., lipidated peptides (Vitiello, et al. (1995) J. Clin. Invest. 95:341-349); 
peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") microspheres 
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(Eldridge, et al. (1991) Molec. Immunol 28:287-294; Alonso, et al. (1994) Vaccine 12:299- 
306; Jones, et al. (1995) Vaccine 13:675-681); peptide compositions contained in immune 
stimulating complexes (ISCOMS) (Takahashi, et al. (1990) Nature 344:873-875; Hu, et al. 
(1998) Clin. Exp. Immunol 113:235-243); multiple antigen peptide systems (MAPs) (Tam 
5 (1988) Proc. Nat'l Acad. Sci. USA 85:5409-5413; Tam (1996) J. Immunol Methods 196:17- 
32); peptides formulated as multivalent peptides; peptides for use in ballistic delivery 
systems, typically crystallized peptides, viral delivery vectors (Perkus, et al. in Kaufinann 
(ed. 1996) Concepts in Vaccine Development de Gruyter; Chakrabarti, et al. (1986) Nature 
320:535-537; Hu, et al (1986) Nature 320:537-547; Kieny, et al (1986) AIDS 

10 Bio/Technology 4:790; Top, et al (1971) J. Infect. Pis. 124:148-154; Chanda, et al. (1990) 
Virology 175:535-547), particles of viral or synthetic origin (see, e.g., Kofler, et al. (1996) I 
Immunol. Methods 192:25-35; Eldridge, et al (1993) Sem. Hematol. 30:16-24; Falo, et al. 
(1995) Nature Med. 7:649-653), adjuvants (Warren, et al (1986) Annu. Rev. Immunol 
4:369-388; Gupta, et al (1993) Vaccine 1 1:293-306), liposomes (Reddy, et al. (1992) J. 

15 Immunol 148:1585-1589; Rock (1996) Immunol Today 17:131-137), or, naked or particle 
absorbed cDNA (Ulmer, et al (1993) Science 259:1745-1749; Robinson, et al. (1993) 
Vaccine 1 1 :957-960; Shiver, et al in Kaufinann (ed. 1996) Concepts in Vaccine 
Development de Gruyter; Cease and Berzofsky (1994) Annu. Rev. Immunol. 12:923-989; 
and Eldridge, et al (1993) Sem. Hematol 30:16-24). Toxin-targeted delivery technologies, 

20 also known as receptor mediated targeting, such as those of Avant Immunotherapeutics, Inc., 
Needham, MA, may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a substance 
designed to protect the antigen from rapid catabolism, such as aluminum hydroxide or 
mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 

25 Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 
aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 

30 tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 

88 



WO 03/003906 PCT/US02/21338 

polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

Vaccines can be administered as nucleic acid compositions wherein DNA or RNA 

5 encoding one or more of the polypeptides, or a fragment thereof, is administered to a patient. 
This approach is described, for instance, in Wolff, et al. (1990) Science 247:1465-1468 as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 
WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 
include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 

1 0 cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
(see, e.g., U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the invention 
can be expressed by viral or bacterial vectors. Examples of expression vectors include 
attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 

15 vaccinia virus, e.g., as a vector to express nucleotide sequences that encode bladder cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 
Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 

20 described in Stover, et al. (1991) Nature 351:456-460. A wide variety of other vectors useful 
for therapeutic administration or immunization, e.g., adeno and adeno-associated virus 
vectors, retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the 
like. See, e.g., Shata, et al. (2000) Mol Med Today 6:66-71; Shedlock, et al (2000) J. 
Leukoc. Biol. 68:793-806; Hipp, et al. (2000) In Vivo 14:571-85. 

25 Methods for the use of genes as DNA vaccines are well known, and include placing a 

bladder cancer gene or portion of a bladder cancer gene under the control of a regulatable 
promoter or a tissue-specific promoter for expression in a bladder cancer patient. The 
bladder cancer gene used for DNA vaccines can encode full-length bladder cancer proteins, 
but more preferably encodes portions of the bladder cancer proteins including peptides 

30 derived from the bladder cancer protein. In one embodiment, a patient is immunized with a 
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DNA vaccine comprising a plurality of nucleotide sequences derived from a bladder cancer 
gene. For example, bladder cancer-associated genes or sequence encoding subfragments of a 
bladder cancer protein are introduced into expression vectors and tested for their 
immunogenicity in the context of Class I MHC and an ability to generate cytotoxic T cell 
5 responses. This procedure provides for production of cytotoxic T cell responses against cells 
which present antigen, including intracellular epitopes. 

In a preferred embodiment, the DNA vaccines include a gene encoding an adjuvant 
molecule with the DNA vaccine. Such adjuvant molecules include cytokines that increase 
the immunogenic response to the bladder cancer polypeptide encoded by the DNA vaccine. 

10 Additional or alternative adjuvants are available. 

In another preferred embodiment bladder cancer genes find use in generating animal 
models of bladder cancer. When the bladder cancer gene identified is repressed or 
diminished in cancer tissue, gene therapy technology, e.g., wherein antisense RNA directed 
to the bladder cancer gene will also diminish or repress expression of the gene. Animal 

15 models of bladder cancer find use in screening for modulators of a bladder cancer-associated 
sequence or modulators of bladder cancer. Similarly, transgenic animal technology including 
gene knockout technology, e.g., as a result of homologous recombination with an appropriate 
gene targeting vector, will result in the absence or increased expression of the bladder cancer 
protein. When desired, tissue-specific expression or knockout of the bladder cancer protein 

20 may be necessary. 

It is also possible that the bladder cancer protein is overexpressed in bladder cancer. 
As such, transgenic animals can be generated that overexpress the bladder cancer protein. 
Depending on the desired expression level, promoters of various strengths can be employed 
to express the transgene. Also, the number of copies of the integrated transgene can be 

25 determined and compared for a determination of the expression level of the transgene. 
Animals generated by such methods find use as animal models of bladder cancer and are 
additionally useful in screening for modulators to treat bladder cancer. 

Kits for Use in Diagnostic and/or Prognostic Applications 
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For use in diagnostic, research, and therapeutic applications suggested above, kits are 
also provided by the invention. In the diagnostic and research applications such kits may 
include one or more of the following: assay reagents, buffers, bladder cancer-specific nucleic 
acids or antibodies, hybridization probes and/or primers, antisense or inhibitory 
5 polynucleotides, ribozymes, dominant negative bladder cancer polypeptides or 

polynucleotides, small molecules inhibitors of bladder cancer-associated sequences etc. A 
therapeutic product may include sterile saline or another pharmaceutically acceptable 
emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions (e.g., 

1 0 protocols) for the practice of the methods of this invention. While the instructional materials 
typically comprise written or printed materials they are not limited to such. A medium 
capable of storing such instructions and communicating them to an end user is contemplated 
by this invention. Such media include, but are not limited to electronic storage media (e.g., 
magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the like. Such 

1 5 media may include addresses to internet sites that provide such instructional materials. 

The present invention also provides for kits for screening for modulators of bladder 
cancer-associated sequences. Such kits can be prepared from readily available materials and 
reagents. For example, such kits can comprise one or more of the following materials: a 
bladder cancer-associated polypeptide or polynucleotide, control positive or negative 

20 samples, reaction tubes, and instructions for testing bladder cancer-associated activity. 

Optionally, the kit contains biologically active bladder cancer protein. A wide variety of kits 
and components can be prepared according to the present invention, depending upon the. 
intended user of the kit and the particular needs of the user. Diagnosis would typically 
involve evaluation of a plurality of genes or products. The genes will be selected based on 

25 correlations with important parameters in disease which may be identified in historical or 
outcome data. 
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EXAMPLES 

Example 1 : Gene Chip Analysis 

Molecular profiles of various normal and cancerous tissues were determined and analyzed 
using gene chips. RNA was isolated and gene chip analysis was performed as described 
5 (Glynne, et al. (2000) Nature 403:672-676; Zhao, et al. (2000) GenesJDev^ 14:981-993). 



TABLE DESCRIPTIONS 
Table 1 A shows about 3413 that exhibit increased or decreased expression in bladder cancer 
10 samples. See USSN 60/302,814. 

Table 2A shows about 485 genes overexpressed in bladder tumors relative to normal tissues 
as analyzed using the Affymetrix/Eos Hu03 GeneChip array. See USSN 60/343,705. 

15 Table 3 A shows about 414 genes upregulated in bladder cancer relative to normal body 
tissues and preferred for utility as small molecule, antibody, DNA vaccine targets for the 
therapy of bladder cancer. These genes were selected from 59680 probesets on the 
Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from 
this analysis was expressed as average intensity (Al), a normalized value reflecting the 

20 relative level of mRNA expression. 

Table 4A shows about 129 genes upregulated in bladder cancer relative to normal body 
tissues and preferred for utility as diagnostics of bladder cancer. These genes were selected 
from 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. Gene expression data for 
25 each probeset obtained from this analysis was expressed as average intensity (Al), a 
normalized value reflecting the relative level of mRNA expression. 

Table 5 A shows about 149 genes upregulated in bladder cancer relative to normal body 
tissues. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 
30 Genechip array. Gene expression data for each probeset obtained from this analysis was 

expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA 
expression. 

Table 6 A shows about 199 genes upregulated in bladder cancer relative to normal bladder 
35 tissue. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 
Genechip array. Gene expression data for each probeset obtained from this analysis was 
expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA 
expression. 

40 Table 7 A shows about 63 genes downregulated in bladder tumors relative to normal bladder. 
These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 Genechip 
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array. Gene expression data for each probeset obtained from this analysis was expressed as 
average intensity (AI), a normalized value reflecting the relative level ofmRNA expression. 

Table 8A shows about 1440 genes upregulated in Ta or Tl bladder tumors from patients who 
5 later presented with muscle-invasive bladder tumors (stage T2-T4). Gene expression data for 
each probeset obtained from this analysis was expressed as average intensity (AI), a 
normalized value reflecting the relative level ofmRNA expression. 

Table 9 A shows about 1200 genes upregulated in Ta or Tl tumors of patients who later 
10 presented with either more Ta tumors or no tumors at all. Gene expression data for each 
probeset obtained from this analysis was expressed as average intensity (AI), a normalized 
value reflecting the relative level ofmRNA expression. 

Table 10A shows about 65 genes upregulated in non-invasive exophytic Ta bladder tumors 
1 5 relative to ( T2-T4 muscle-invasive tumors. Gene expression data for each probeset obtained 
from this analysis was expressed as average intensity (AI), a normalized value reflecting the 
relative level of mRNA expression. 

Table 1 1 A shows about 106 genes upregulated in muscle-invasive T2-T4 bladder tumors 
20 relative to non-invasive exophytic Ta bladder tumors. Gene expression data for each 

probeset obtained from this analysis was expressed as average intensity (AT), a normalized 
value reflecting the relative level of mRNA expression. 

Table 12A shows the Pkey, ExAccn, UnigenelD, and Unigene Title for all of the sequences 
25 in Table 13. Seq ED No. is used to link Table 12A to table 13. 

Tables 1B-12B show the accession numbers for those Pkey's lacking UnigenelD's for tables 
1 A -12A, respectively. For each probeset is listed a gene cluster number from which 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from 
30 Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity 
using Clustering and Alignment Tools (DoubleTwist, Oakland California). Genbank 
accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 

35 Tables 1C-12C show genomic positioning for Pkey's lacking Unigene ED's and accession 
numbers for tables 1 A -12A, respectively. For each predicted exon, is listed genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

40 TAR1F1A: 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

45 Unigene Titte: Unigene gene title 

R1 : please refer to original application 

R2 please refer to original application 

Tarael Type: downregulaie stage if target is downregulated in Madder tumors relative to norma} bladder or 
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early stage if target Is an earl stage (Ta) bladder tumor marker or 
late stage if target is a late stage (T2-T4) bladder tumor marker or 
T2-T4 grade 3 papilloma marker or T2-T4 grade 3 solid tumor marker or Upregulate stage 
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Pkey 


ExAccn 


UnlgenetD 


Unigene Title 


R1 


R2 


Target Type 


400440 


X83957 


Hs.83870 


nebulin 


0.17 


2.05 


downregulate stage 


400888 








0.24 


1.97 


downregulate stage 


401566 








0.19 


4.06 


downregulate stage 


401669 








0.2 


2.05 


downregulate stage 


401691 








0.04 


10.13 


downregulate stage 


401905 








0.3 


1.87 


downregulate stage 


402076 








0.06 


6.51 


downregulate stage 


402110 








0.43 


2.35 


downregulate stage 


402271 


NMJJ02197 


Hs.154721 


aconilasel. soluble 


0.21 


216 


downregulate stage 


403362 






0.18 


4.44 


downregulate stage 


403687 








0.32 


1.91 


downregulate stage 


403959 








0.14 


2.27 


downregulate stage 


404015 








0.2 


2.48 


downregulate stage 


404059 








0.36 


1.84 


downregulate stage 


404152 








0.32 


1.85 


downregulate stage 


404498 








0.14 


2.18 


downregulate stage 


404819 








0.19 


5.25 


downregulate stage 


405001 


U58196 


Hs.296281 


interieukin enhancer binding factor 1 


0.16 


2.92 


downregulate stage 


405349 








0.18 


3.8 


downregulate stage 


405390 








0.3 


254 


downregulate stage 


405735 
405968 








0.13 
0.26 


2.44 
1.85 


downregulate stage 
downregulate stage 


406017 








0.32 


228 


downregulate stage 


406305 


BE261320 


Hs.297096 


transcriptional adaptor 3 (ADA3, yeast h 


0.42 


1.93 


downregulate stage 


406320 








0.37 


201 


downregulate stage 


406704 


M21665 


Hs.929 


myosin, heavy polypeptide 7, cardiac mus 


0.3 


2.84 


downregulate stage 


406706 


X03740 


Hs.231581 


myosin, heavy polypeptide 1, skeletal mu 


0.14 


7.4 


downregulate stage 


406707 


S73840 


Hs.931 


myosin, heavy polypeptide 2, skeletal mu 


0.05 


12.51 


downregulate stage 


407013 


U35637 




gbiHuman nebulin mRNA, partial cds 


0.14 


2.17 


downregulate stage 


407245 


X90568 


Hs.172004 


fifin 


0.02 


15.21 


downregulate stage 


407330 


AA582607 




gb:nn51b05.s1 NCLCGAP_Kid6 Homo sapiens 


0.42 


1.87 


downregulate stage 


407571 


AI446183 


Hs.9572 


ESTs 


0.38 


2.13 


downregulate stage 


407682 


AL035858 


Hs.3807 


FXYD domain-containing ion transport reg 


0.34 


2.56 


downregulate stage 


407815 


AW373860 


Hs.301716 


ESTs 


0.31 


244 


downregulate stage 


407834 


AW084991 


Hs.26100 


ESTs 


0.15 


2.98 


downregulate stage 


407891 


AA486620 


Hs.41135 


endomucin-2 


0.15 


3.33 


downregulate stage 


407906 


AA369665 


Hs.41185 


Homo sapiens mRNA; cDNA DKFZp56401262 (f 


0.12 


8.05 


downregulate stage 


407938 


AA905097 


Hs.85050 


phospholamban 


0.08 


8.77 


downregulate stage 


407965 


W21483 


Hs.41707 


heat shock 27kD protein 3 


0.26 


2.29 


downregulate stage 


408009 


AF020498 


Hs.41735 


purinergic receptor P2X, figand-gated io 


0.49 


1.91 


downregulate stage 


408139 


AA451966 


Hs.43005 


RAB9-like protein 


0.41 


1.88 


downregulate stage 


408221 


AA912183 


Hs.47447 


ESTs 


0.04 


24.1 


downregulate stage 


408374 


AW025430 


Hs.155591 


forkhead box F1 


0.35 


2.85 


downregulate stage 


408493 


BE206854 


Hs.46039 


phosphoglycerate mutase 2 (muscle) 


0.09 


9.04 


downregulate stage 


408508 


AI806109 


Hs.135736 


KIAA1580 protein 


0.45 


2.2 


downregulate stage 


408614 


AL1 37698 


Hs.46531 


Homo sapiens mRNA; cDNA DKF2p434C1915 (f 


0.13 


5.48 


downregulate stage 


408652 


R43409 


Hs.6829 


ESTs 


0.33 


2.5 


downregulate stage 


408753 


A1337192 


Hs.47438 


SH3 domain binding glutamic acid-rich pr 


0.05 


6.94 


downregulate stage 


408896 


AI610447 


Hs.48778 


niban protein 


0.39 


2.01 


downregulate stage 


409277 


T05558 


Hs.156880 


ESTs 


0.18 


2.75 


downregulate stage 


410023 


AB017169 


Hs.57929 


slit (Drosophila) homolog 3 


0.07 


2.7 


downregulate stage 


410036 


R57171 


Hs.57975 


calsequestrin 2, cardiac muscle 


0.09 


5.23 


downregulate stage 


410132 


NMJJ03480 


Hs.58882 


Microfibril-associated gtycoprotein-2 


0.24 


234 


downregulate stage 


410168 


AW834050 


Hs.9973 


tensin 


0.39 


2.17 


downregulate stage 


410243 


083402 


Hs.289006 


ESTs, Weakly similar to alternatively sp 


0.11 


2.82 


downregulate stage 


410339 


AI916499 


Hs.298258 


ESTs 


0.15 


2.16 


downregulate stage 


410677 


NM 003278 


Hs.65424 


tetranectin (p)asminogen-binding protein 


0.32 


265 


downregulate stage 


410858 


T06529 


Hs.98516 


Homo sapiens cDNA FU11490 fis, clone HE 


0.2 


274 


downregulate stage 


411048 


AK001742 


Hs.67991 


hypothetical protein DKFZp434G0522 


0.2 


1.92 


downregulate stage 


411067 


AI681006 


Hs.301543 


ESTs 


0.11 


3.41 


downregulate stage 


411069 


AL133092 


Hs.68055 


hypothetical protein DKFZp434l0428 


0.17 


5.8 


downregulate stage 


411644 


H92064 


Hs.301548 


ESTs 


0.06 


13.8 


downregulate stage 


411741 


AW859650 




gb:RCO.CT0358471299411-d03 CT0358 Homo 


0.36 


25 


downregulate stage 


412047 


AA934589 


Hs.49896 


ESTs 


0.18 


3.57 


downregulate stage 


412095 


AI624707 


Hs.5921 


Homo sapiens cONA: FU21592fis, done C 


0.32 


1.89 


downregulate stage 


412389 


AW947655 




gb:RCO-MT0003.140300.031-b07 MT0003 Homo 


0.38 


26 


downregulate stage 


412442 


AI983730 


Hs.26530 


serum deprivation response (phosphatidyl 


0.12 


3.67 


downregulate stage 


412519 


AA196241 


Hs.73980 


troponin T1, skeletal, slow 


0.24 


1.86 


downregulate stage 


412622 


AW664708 


Hs.171959 


ESTs 


0.06 


5.45 


downregulate stage 


412649 


NM_002206 


Hs.74369 


integrin, alpha 7 


0.29 


295 


downregulate stage 


412659 


AW753865 


Hs.74376 


olfactomedin related ER localized protei 


0.18 


206 


downregulate stage 


412758 


Y07818 


Hs.74566 


dihydropyrimidinase-like 3 


0.3 


223 


downregulate stage 
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412802 


U41518 


Hs.74602 


412975 


T70956 


Hs.75106 


413074 


AB71358 


Hs.8417 


413272 


AA1 27923 


Hs.293256 


413276 


Z24725 


Hs.75260 


413508 


BE145364 




413624 


BE177019 


Hs.75445 


413778 


M090235 


Hs.75535 


414053 


H26904 


Hs.75736 


414241 


M425085 


Hs.4007 


414290 


A1568801 


Hs.71721 


414629 


AA345824 


Hs.76688 


414657 


AA424074 


Hs.76780 


414712 


N88858 


Hs.77039 


414903 


AA451700 


Hs.85835 


415165 


AW887504 


Hs.78065 


415274 


AF001548 


Hs.78344 


415447 


Z97171 


Hs.78454 


415672 


N53097 


Hs.193579 


415934 


NMJ00928 


Hs.992 


416127 


N49843 


Hs.79022 


416349 


X69089 


Hs.79227 


416585 


X54.162 


Hs.79386 


416854 


H40164 


Hs.80296 


416941 


BE000150 


Hs.48778 


416982 


J05401 


Hs.80691 


417011 


F08212 


Hs.234898 


417298 


AW665639 


Hs.37958 


417501 


AL041219 


Hs.82222 


417553 


L09190 


Hs.82276 


417987 


AA210872 


Hs.50133 


418297 


R91254 




418332 


R34976 


Hs.78293 


418391 


NMJJ03281 


Hs.84673 


418409 


AA219332 


Hs.120869 


418421 


R58620 


Hs.85050 


418489 


U76421 


Hs.85302 


418533 


NM 004533 


Hs.85937 


418787 


AW296134 


Hs.86999 


418793 


AW382987 


Hs.88474 


418947 


W52990 


Hs.22860 


419037 


R39895 


Hs.7864 


419441 


AW023731 


Hs.274368 


419535 


AW139550 


Hs.1 15173 


419682 


H13139 


Hs.92282 


419685 


W76083 


Hs.173077 


419703 


AI793257 


Hs.128151 


419942 


U25138 


Hs.93841 


420058 


AK00H23 


Hs.94694 


420195 


N44348 


Hs.300794 


420261 


AW206093 


Hs.748 


420674 


NMJ00055 


Hs.1327 


421296 


NMJJ02666 


Hs.103253 


421639 


NMJJ12082 


Hs.297921 


421763 


AW163500 


Hs.108080 


421853 


AL1 17472 


Hs.108924 


422103 


M984330 


Hs.1 11676 


422287 


F16365 


Hs.1 14346 


422320 


AI745249 


Hs.23650 


422633 


X56832 


Hs.1 18804 


422639 


AI929377 


Hs.173724 


423334 


AK000906 


Hs.1 27273 


423587 


AA328074 


Hs.284256 


423889 


AL035447 


Hs.134594 


424181 


AL039482 


Hs.1 42517 


424206 


NMJJ03734 


Hs.198241 


424479 


AF064238 


Hs.149098 


424580 


AA446539 


Hs.35092 


424846 


AU077324 


Hs.1832 


424938 


AW102607 


Hs.245233 


424982 


U94777 


Hs.154084 


425383 


D83407 


Hs.1 56007 


425545 


N98529 


Hs.158295 


425622 


AW360847 


Hs.16578 


425751 


T19239 


Hs.1 940 


425869 


AA524547 


Hs.160318 


425887 


AL049443 


Hs.161283 



3quaporin 1 (channeMorming integral pr 


0.11 


2.71 


down regulate stage 


dusterin (complement lysis inhibitor, S 


0.44 


2.03 


downregulaie stage 


ESTs 


0.47 


1.91 


downreguJate stage 


ESTs 


0.09 


4.44 


downregulate stage 


mitogen inducible 2 


0.23 


248 


otowftregtrfats stage 


gb:ll0-HT0198-151099*12S«05 HT0198 Homo 


0.31 


253 


downregulaie stage 


SPARC-tike 1 (masl^ hevin) 


0.33 


2.17 


downreguJate stage 


myosin, light polypeptide 2, regulatory, 


0.33 


263 


downregulaie stage 


apoEpoprotein D 
Sarcdemmal-assodated protein 


0.42 


1.85 


downreguJate stage 


0.16 


222 


downregutate stage 


ESTs 


ao2 


10 


downreguJate stage 


carboxyiesterase 1 (monocyte/macrophage 


.0.13 


4.14 


down regulate stage 


protein phosphatase 1, regulatory (inhib 


0.33 


214 


downregulaie stage 


ribosomal protein S3A 


0.4 


25 


downregutate stage 
downregulate stage 


Homo sapiens cDNA: FU22841 fis, done K 


0.3 


3.3 


complement component 7 


0.04 


3.41 


downregulaie stage 


myosin, heavy polypeptide 11, smooth mus 


0.2 


3.29 


downregutate stage 


myocilin, trabecular meshwork inducible 


0.15 


6.55 


downregutate stage 


ESTs 


0.28 


3.55 


downreguJate stage 


phospholipase A2, group IB (pancreas) 


0.34 


264 


downregutate stage 


GTP-binding protein overexpressed in ske 


0.3 


1.98 


downreguJate stage 


myomesin (M-protein) 2 (165kD) 


0.41 


1.96 


downregutate stage 


ieiomodin 1 (smooth muscle) 


0.02 


49.3 


downregutate stage 


Purkinje cell protein 4 


0.02 


7.55 


downregutate stage 


niban protein 


0.27 


216 


downregutate stage 


creatine kinase, mitochondrial 2 (sarcom 


0.29 


243 


downregutate stage 
downregutate stage 


ESTs 


0.41 


2.06 


ESTs 


0.27 


3.7 


downreguJate stage 


sema domain, immunoglobulin domain (lg), 


0.39 


208 


downregulate stage 


trichohyafin 


0.29 


259 


downregutate stage 


ESTs 


0.22 


2.09 


downregulate stage 


gb:yp94e1 2s 1 Soares fetal liver spleen 


0.28 


1.9 


downregulate stage 


ESTs 


0.2 


3.9 


downregulate stage 


troponin I. skeletal, slow 


0.35 


202 


downregulate stage 
downregulate stage 


ESTs, Weakly similar to AF092922 1 reiin 


0.21 


3.8 


phospholamban 


0.2 


2.08 


downregulate stage 


adenosine deaminase, RNA-specific, B1 (h 


0.05 


21.55 


downregulate stage 


myosin-binding protein C, fast-type 


0.42 


1.95 


downregulate stage 


ESTs 


0.48 


1.87 


downregulate stage 


prostaglandin-endoperoxide synthase 1 (p 


0.26 


243 


downregulate stage 


ESTs 


0.13 


7.18 


downregulate stage 


ESTs 


0.27 


2 


downregulate stage 


Homo sapiens mRNA; cDNA DKFZp586l1524 (f 


0.19 


5.25 


downregulate stage 


ESTs 


0.31 


259 


downregulate stage 


paired-like homeodomain transcription fa 


0.28 


238 


downregutate stage 


ESTs 


0.4 


221 


downregulate stage 


ESTs 


0.09 


3.52 


downregulate stage 


potassium large conductance calcium-acti 


0.28 


296 


downregulate stage 


Homo sapiens cDNA FU10561 fis, clone NT 


0.3 


209 


downregulate stage 


ESTs 


0.22 


279 


downregulate stage 
downregulate stage 


fibroblast growth factor receptor 1 (fms 


0,35 


1.98 


butyryteholinesterase 


0.29 


3.5 


downregulate stage 


perilipin 


0.36 


211 


downregulate stage 


Homo sapiens mRNA fuO length Insert cDN 


0.13 


4.3 


downregulate stage 


cysteine and grycine-rich protein 1 


0.26 


3.49 


downregulate stage 


DKFZP586P1422 protein 


0.14 


5 


downregulate stage 


protein kinase H1 1 ; smaD stress protein 


0.2 


229 


downregulate stage 


cytochrome c oxidase subunit Vila polype 


0.27 


258 


downregulate stage 


ESTs, Weakly similar to AAB47496 NG5 [H. 


0.24 


295 


downregulate stage 


enolase 3, (beta, muscle) 


0.23 


3.57 


downregulate stage 


creatine kinase, brain 


0.39 


1.97 


downregulate stage 


hypothetical protein FU10044 


0.37 


229 


downregulate stage 


hypothetical protein FU14033 similar to 


0.37 


2.47 


downregulate stage 


hypothetical protein LOC57158 


0.24 


243 


downregulate stage 
downregulate stage 


Homo sapiens mRNA; cum DKFZp434P0810 (f 


0.27 


228 


amine oxidase, copper containing 3 (vase 


0.3 


2.59 


downregutate stage 


smoothetin 


0.26 


3.29 


downregulate stage 


ESTs 


0.15 


257 


downregulate stage 


neuropeptide Y 


0.4 


204 


downregulate stage 


ESTs 


0.29 


216 


downregulate stage 


phosphorylase, glycogen; muscle (McArdle 


0.42 


1.89 


downregulate stage 


Down syndrome critical region gene 1-lik 


0.14 


1.86 


downregulate stage 


Human mRNA for myosin Rghl chain 3 (MLC 


0.03 


13.25 


downregulate stage 


ESTs 


0.3 


219 


downregutate stage 


crystatlin, alpha B 


0.47 


1.92 


downregulate stage 


FXYD domain-corrtaining ion transport reg 


0.45 


1.85 


downregulate stage 


Homo sapiens mRNA; cDNA DKFZp586N2020 (f 


0.19 


285 


downregulate stage 
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425932 


M81650 


Hs.1968 


semenogeiin 1 


0.02 


16.3 


downreguiate stage 


426354 


NMJJ04010 


Hs.1 69470 


dystrophin (muscular dystrophy, Duchenne 


0.27 


2.52 


downregulate stage 


425429 


X73114 


Hs.169849 


rnyosin-blnding protein C, slow-type 


0.1 


10.3 


downreguiate stage 


426752 


X69490 


Hs.172004 


titin 


0.03 


31.3 


downregulate stage 


426809 


BE313114 


Hs.29706 


ESTs 


0.34 


2.95 


downreguiate stage 


427078 


A1676062 


Hs.1 11902 


ESTs 


0.22 


2.11 


downregulate stage 


427136 


AL117415 


Hs.173716 


Homo sapiens mRNA; cDNA DKFZp434K052l (f 


0.37 


2.33 


downregulate stage 


427164 


AB037721 


Hs.173871 


KIAA1 300 protein 


0.12 


5.47 


downregulate stage 


427185 


AA398930 


Hs.186674 


ESTs 


0.22 


4.65 


downregulate stage 


427373 


AB007972 


Hs.177533 


Homo sapiens mRNA: cDNA DKFZp586N0318 (f 


0.22 


3.18 


downregulate stage 


427393 


AB029018 


Hs.177635 


KIAA1095 protein 


0.27 


2.13 


downregulate stage 


427665 


AF134803 ■ 


Hs.180141 


cofilin 2 (muscle) 


0.05 


4 
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gb:EST1 0534 Adipose tissue, white I Homo 
Hs. 180946 rtexnaJ protein L5 

gb:RC4OT0071-2403(KM)13-b04 OT0071 Homo 
Hs.97056 hypothetical protein FU21634 
Hs.122139 . ESTs 
Hs.13333 ESTs 
Hs.30164 ESTs 

Hs.57904 mago-nashi (Drosophila) homotog, profife 
Hs.58169 highly expressed in cancer, rich in leuc 

gb:RC1-STQ27W)80100-01 1-h04 ST0278 Homo 
Hs.279727 Homo sapiens cDNA FU14Q35 fe, clone HE 
Hs.271280 ESTs 

Hs.58650 hypothetical protein FU10724 

Hs.15830 Homo sapiens cDNA FU12691 fis, clone NT 

Hs.261285 ptetotropic regulator 1 (PRL1, Arabidops 

Hs.59838 hypothetical protein FU10808 

Hs.61418 microfitoflla*assoclated protein 1 

Hs.B1485 hypothetical protein 

Hs.68301 ESTs 

Hs.282230 ESTs 

Hs.62264 KIAA0937 protein 

Hs.62643 dual adaptor of phosphotyrosine and 3-ph 

Hs.22039 hepatocyte nuclear factor 3, alpha 

Hs.63236 synuclein, gamma (breast cancer-specific 

Hs.1 72405 ESTs, Moderately similar to I52835 H-NUC 

Hs.63657 hypothetical protein FU11005 

Hs.637B8 propionyl Coenzyme A carboxylase, beta p 

gb:QV0-BT0537-231299-O49-f03 BT0537 Homo 
gb:zb80d09.s1 Soares.senescent fibroblas 
Hs.83286 ESTs 

Hs.188662 Homo sapiens cDNA: FU23421 lis, done H 

gb:!L3-CT0219-281099-024-A03 CT0219 Homo 
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Hs.68784 ESTs 
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Hs.266859 ESTs 

Hs.65370 lipase, endothelial 

Hs.65403 hypothetical protein 

Hs.293950 ESTs 

gb:EST66726 Fetal lung III Homo sapiens 
gb:yc16b10.s1 Stratagene lung (937210) H 

Hs.66170 HSKM-B protein 

Hs.250164 ESTs, Weakly similar to coded for by C. 

Hs.288836 Homo sapiens cDNA FU 12673 lis, clone NT 

Hs.154669 ESTs 

Hs.66521 Machado-Joseph disease (spinocerebellar 

gb:MR4-ST0062-031199-018-d06 ST0062 Homo 
Hs.66718 RAD54 (S.cerevisiae)-like 
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gb:RC0-ST01 74-21 1 099-01 1 -hi 2 ST0174 Homo 
Hs.25092 ESTs 

gb:RC3-ST0186-3O0100-O17-e04 ST0186 Homo 
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Hs.297007 ESTs 

gb:yv07f01.s1 Soares fetal liver spleen 
gb:CM1-ST0283-071299"061-d08 ST0283 Homo 
gb:RC5-ST0293-021299-031-A04 ST0293 Homo 
gb:QV0-STQ29W030(M51-f02 ST0294 Homo 
gb.-QVO-ST0294.100400-185^07 ST0294 Homo 
gb.-QV2-ST0298-140200-042-b05 ST0298 Homo 
gb:tL2-ST031 1-21 1 299-028-F1 2 ST031 1 Homo 
gb:QV4-HT0222-181099-013-g03 HT0222 Homo 
gb:QV4-n0008-27109W)2O^01 TT0008 Homo 
kinesis-like 6 (mitotic centromere-assoc 
gb:QVO-8N0040.170300-161-d07 BN0O4O Homo 
Hs.1 351 91 ESTs, Weakly similar to unnamed protein 
Hs.42680 ESTs 

gb:QVMT0036-15020(W)74-h06 LT0036 Homo 
Hs.1 1 61 22 ESTs, Weakly similar to unnamed protein 

gb:RC3*T0470-120200-0134>10 HT0470 Homo 
Hs.69749 KIAA0087 gene product 
Hs.130815 hypotheOcal protein FU21870 
Hs.69851 GAR1 protmn 
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1 


1 


upregulate stage 


ESTs 
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0.59 
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Fanconi anemia, complementation group A 


4.75 


ai3 


upregulate stage 


cleavage and polyadenylation specific fa 


1 


0.23 


upregulate stage 


ESTs 


7.95 


0.09 


upregulate stage 


Human rnRNA for K1AA0075 gene, partial cd 


3 


0.15 


upregulate stage 


Homo sapiens cDNA RJ10238 fis, done HE 


17.2 


0.04 


upregulate stage 


bone morphogenetic protein receptor, lyp 


1 


1 


upregulate stage 


annexinA8 


Z11 


0.3 


upregulate stage 


ESTs 


6.75 


0.07 


upregulate stage 


Ihrornbospondin 1 


4.5 


0.08 


upregulate stage 


ESTs 


1 


0.33 
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gb:HSC0BF041 normalized infant brain cDN 


1 


0.91 


upregulate stage 


ESTs 


4,7 


0.12 


upregulate stage 


ESTs, Weakly similar to Weak similarity 


5.85 


0.1 


upregulate stage 


ESTs 


3.6 


0.09 
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upregulate stage 


hypothetical protein FU12476 


1 


1 


Homo sapiens cDNA FU13150 fis, done NT 


9.15 


0.09 


upregulate stage 


arachidonate 12-lipoxygenase 


9.25 


0.08 
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msh {DrosophilaJ homeo box homolog 2 


1.83 


0.43 
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ESTs 


11.3 


0.06 
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ESTs 
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1 
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ESTs 


4.75 


0.12 


laminBI 


2.6 


0.13 


upregulate stage 


arachidonate 5-lipoxygenase 


1.45 


0.53 


upregulate stage 


ESTs 
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0.23 
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gbyd84a05.r1 Soares fetal forer spleen 


1 


0.38 


upregulate stage 


msulinoma-assodated 1 


1.25 


0.18 


upregulate stage 


parathyroid hormone-fike hormone 


1 


1 
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upregulate stage 


ESTs, Weakry similar to 0RF2 function u 


1.59 


0.3 


cytochrome P450. subfamily XXIV (vitamin 


3.55 


0.05 
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ESTs, Weakly similar to ALU1 .HUMAN ALU S 


1 


0.27 
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ESTs 


1 


0.37 
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ESTs 
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0.07 
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ESTs 


1 


1 


progesterone binding protein 


10.6 


0.06 


upregulate stage 
upregulate stage 


chromosome segregation 1 (yeast homotog) 


1.84 


0.47 


ESTs 


1 


1 


upregulate stage 


ESTs, Highly similar to AF1 61 358 1 HSPC0 


6.6* 


0.1 


upregulate stage 


PTK7 protein tyrosine kinase 7 


1.31 


0.64 


upregulate stage 


gb:EST390147 MAGE resequences, MAGO Homo 


1 


1 
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ESTs 


1.1 


0,24 
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419477 


AA826279 




gb:od03g07-$1 NCi_CGAP_GCB1 Homo sapiens 


1 


0.56 


419484 


AA243474 


Hs.272128 


Homo sapiens cDNA RJ13901 fis, clone TH 


1 


0.22 


419506 


N20912 


Hs.42369 


ESTs 


1 


1 


419554 


AI732138 


Hs.104318 


ESTs 


1 


0.5 


419569 


AI971651 


Hs.91143 


lagged 1 (AlagiUe syndrome) 


1 


0.91 


419594 


AA013051 


Hs.91417 


topoisomerase (DNA) II binding protein 


8.1 


0.08 


419651 


NMJ07023 


Hs.91971 


cAMP-regulated guanine nucleotide exchan 


1 


1 . 


419666 


NMJ14810 


Hs.92200 


K1AA048O gene product 


5.2 


0.12 


419737 


H24185 


Hs.92918 


hypothetical protein 


11.7 


0.07 


419743 


AW408762 


Hs.127478 


ESTs 


6.1 


0.09 


419752 


AA249573 


Hs.152618 


ESTs 


1.8 


0.17 


419769 


H27374 


Hs.103483 


ESTs 


1 


0.36 


419805 


AW966945 




gb£ST379019 MAGE resequences, MAGJ Homo 


1 


0.34 


419807 


R77402 




gb:yi75f 11.s1 Soares placenta Nb2HP Homo 


1 


0.67 


419831 


AW448930 


Hs.5415 


ESTs 


7.05 


0.1 


419833 


AA251131 


Hs.220697 


ESTs 


1.25 


0.53 


419834 


AA251139 




gb:zs03g12.$1 NCt_CGAP_GC81 Homo sapiens 


1 


1 


419923 


AW081455 


Hs.120219 


ESTs 


5.89 


0.13 


419945 


AW290975 


Hs.118923 


ESTs 


1 


0.24 


419962 


AA830111 


Hs.291917 


ESTs 


1 


1 


419970 


AW612022 


Hs.263271 


ESTs 


9.15 


0.09 


419986 


AI345455 


Hs.78915 


GA-binding protein transcription factor, 


3.05 


0.17 


419998 


AA252691 




gb:zs26d09.r1 NCI CGAPJ3CB1 Homo sapiens 


1 


0.47 


420016 


AW016908 


Hs.88025 


ESTs 


1 


0.8 


420047 


AI478658 


Hs.94631 


brefeidin A-lnhibited guanine nucleotide 


4.8 


0.11 


420076 


AA827860 


Hs.293717 


ESTs 


5.35 


0.12 


420111 


AA255652 




gb:zs21h11.r1Na CGAP GCB1 Homo sapiens 


5.3 


0.11 


420145 


AA809860 


Hs.256264 


ESTs 


1 


1 


420159 


A1572490 


Hs.99785 


Homo sapiens cDNA: FU21245 fis, clone C 


14.8 


0.04 


420161 


AI683069 


Hs.175319 


ESTs 


4.7 


0.11 


420184 


AA188408 


Hs.95665 


hypothetical protein 


4.35 


0.15 


420226 


AA773709 


Hs.1 52818 


ubiquitin specific protease 8 


3.1 


0.16 


420230 


AL034344 


Hs.298020 


Homo sapiens cDNA FU1 1796 fis, clone HE 


10.35 


0.06 


420236 


AA256763 


Hs.291111 


ESTs 


4.45 


0.14 


420270 


AA257990 




gb:zs35h07.r1 NCI CGAP.GCB1 Homo sapiens 


10.05 


0.08 


420297 


AI628272 


Hs.88323 


ESTs 


9.45 


0.09 


420344 


BE463721 


Hs.97101 


putative G protein-coupled receptor 


11.7 


0.05 


420392 


A1242930 


Hs.97393 


KIAA0328 protein 


17 


0.22 


420413 


AW971624 


Hs.120605 


ESTs 


1 


1 


420445 


AA262213 


Hs.193514 


ESTs 


1 


1 


420471 


AA262452 


Hs.192268 


ESTs 


3.95 


0.13 


420479 


AW183695 


Hs.186572 


ESTs 


4.95 


0.12 


420493 


A1635113 


Hs.270366 


Homo sapiens mRNA; cDNA DKFZp564H0616 (f 


4.4 


0.15 


420552 


AK000492 


Hs.98806 


hypothetical protein 


11.55 


0.06 


420572 


AL035593 


Hs.99016 


Human DNA sequence from clone 310J6 on c 


1.35 


0.22 


420643 


W87731 




gb:zh65g1 0.r1 Soares_fetalJiver_spleen_ 


1.25 


0.25 


420650 


AA455706 


Hs.44581 


heat shock protein hsp70-related protein 


7.3 


0.09 


420654 


AA279091 


Hs.104420 


ESTs 


1 


0.27 


420655 


R74405 


Hs.300886 


ESTs 


1 


1 


420717 


AA2B4447 


Hs.271887 


ESTs 


9 


0.09 


420734 


AW972872 


Hs.293736 : 


ESTs 


5.2 


0.13 


420756 


AA411800 


Hs.189900 


ESTs 


1 


1 


420789 


AI670057 


Hs.1 99882 


ESTs 


8.85 


0.05 


420802 


U22376 


Hs.1334 


v-myb avian myeloblastosis viral oncogen 


4.6 


0.12 


420851 


AA281062 


Hs.250734 


ESTs 


6.35 


0.08 


420880 


AI609621 


Hs.105620 


ESTs 


1 


1 


420923 


AF097021 


Hs.273321 


differentially expressed in hematopoiefi 


10.4 


0.03 


420928 


AA281809 




gb:zt10e01.r1 NCI_CGAP_GCB1 Homo sapiens 


1 


1 


420936 


AM56112 


Hs.99410 


ESTs 


8.71 


0.07 


420947 


AA491044 


Hs.47196 


ESTs 


1 


0.38 


421017 


AW979181 


Hs.293221 


ESTs, Weakly similar to ALU1 .HUMAN ALU S 


1 


1 


421064 


AI245432 


Hs.101382 


tumor necrosis factor, alpha-induced pro 


1.26 


0.62 


421070 


AA283185 


Hs.19327 


ESTs 


2.2 


0.14 


421100 


AW351839 


Hs.124660 


Homo sapiens cDNA: FU21763 fis, done C 


1.69 


0.28 


421102 


AI470093 


Hs.89217 


ESTs 


Z65 


0.19 


421103 


AI625835 


Hs.27104 


ESTs 


6 


0.1 


421114 


AW975051 


Hs.293156 


ESTs 


4.7 


0.12 


421118 


AI471925 


Hs.89257 


ESTs 


1 


0.39 


421155 


H87879 


Hs.102267 


rysyi oxidase 


1.15 


0.18 


421159 


AW978316 


Hs.136649 


ESTs 


1 


0.44 


421187 


NM.014721 


Hs.102471 


KIAA0680gene product 


5.7 


0.11 


421218 


NNL000499 


Hs.72912 


cytochrome P450, subfamily I (aromatic c 


0.07 


2.55 


421221 


AW276914 


Hs.300877 


ESTs 


8.75 


0.07 


421229 


AI056590 


Hs.7086 


Homo sapiens cONA: FU23000 fis, clone L 


1.64 


0.49 


421261 


AA600853 


Hs.98133 


ESTs 


10.9 


0.07 


421262 


AA286746 


Hs.9343 


Homo sapiens cDNA FU14265 fis, clone PL 


2.65 


0.16 


421278 


AI367919 


Hs.99691 


ESTs 


1 


0.56 
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gb:ob39a05*1 NCI_CGAP_GCB1 Homo sapiens 


1 


0.34 


upregulate stage 


421282 


AA286914 


Hs.1 83299 


ESTs 


9 


0.03 


upregulate stage 
upregulate stage 


421306 


AA806207 


Hs.1 25889 


ESTs 


1 


0.95 


421308 


AA687322 


Hs.1 92843 


ESTs 


2.85 


0.15 


upregulate stage 


421373 


AA808229 


Hs.167771 


ESTs 


145 


0.14 


upregulate stage 


421379 


Y15221 


Hs.1 03982 


small inducible cytokine subfamily B (Cy 


1.6 


0.% 


upregulate stage 


421381 


AA361752 




gb:EST71 314 T-cell lymphoma Homo sapiens 


5.05 


0.09 


upregulate stage 


421418 


AA806639 




gb:ob88g05.s1 NCLCGAPJ3C81 Homo sapiens 


6.55 


0.1 


upregulate stage 


421433 


A1829192 


Hs.1 34805 


ESTs 


9.9 


0.07 


upregulate stage 


421451 


AA291377 


Hs.50831 


ESTs 


11.9 


0.06 
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upregulate stage 


421491 


H99999 


Hs.42736 


ESTs 


3 


0.2 


421493 


BE300341 


Hs.104925 


ectodermal-neural cortex (with BTB-Iixe 


2.51 


0.32 


upregulate stage 


421559 


NM_014720 


Hs.1 05751 * 


Ste20-related serine/threonine kinase 


9 


0.09 


upregulate stage 


421577 


BE465451 


Hs.105925 


single-minded {Drosophila) homotog 1 


5.75 


0.12 


upregulate stage 


421673 


H54384 


Hs.36892 


ESTs 


1 


1 
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421685 


AF1 89723 


Hs.1 06778 


ATPase, Ca++ transporting, type 2C, memb 


9.45 


0.07 
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421708 


AW754341 




gb:Cfi«)-CTO341-181299-13O-h12CT0341 Homo 


1 


0.47 


421733 


AL1 19671 


Hs.1 420 


fibroblast growth factor receptor 3 (ach 


1.97 


0.33 


upregulate stage 


421838 


AW881089 


Hs.108806 


Homo sapiens mRNA; cDNA DKFZp566M0947 (f 


7.05 


0.1 


upregulate stage 


421869 


AB003592 


Hs.109050 


contac6n6 


1 


1 


upregulate stage 


421925 


S80310 


Hs.109620 


acidic epididyma) giycoprotein-tike 1 


1 


1 
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421948 


L42583 


Hs.1 11758 


keratin 6A 


51.9 


0.01 
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421958 


AA357185 


Hs.109918 


ras homotog gene family, member H 


10.17 


0.07 


421991 


NM_014918 


Hs.1 10488 


WAAQ990 protein 


4.5 


0.17 
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422026 


U80736 


Hs.1 10826 


trinucleotide repeat conlaining 9 


6.5 


0.08 
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422072 


AB018255 


Hs.1 11 138 


WAA0712 gene product 


9.2 


0.08 
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422094 


AF129535 


Hs.272027 


F-box only protein 5 


6.95 
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L10343 


Hs.1 12341 


protease inhibitor 3, skin-derived (SKAL 
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0.17 
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AA586894 


Hs.1 12408 


S100 caicium-binding protein A7 {psorias 


3.96 


0.1 
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AL043892 


Hs.180582 


Homo sapiens cDNA: FU21836 fis, clone H 
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gb;EST44247 Fetal brain I Homo sapiens c 
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AA307595 


Hs.1 19908 


nucleolar protein NOP5/NOP58 
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AB038995 


Hs.114159 


RAB-8b protein 


5.04 


0.16 
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frizzled (Drosophila) homolog 6 


3.46 


0.24 
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AF019225 


Hs.114309 


apofipoprotein L 


4.54 


0.14 
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AB022192 


Hs.1 15240 


peroxisome biogenesis factor 13 
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Hs.1 15263 


epiregulin 


4.45 


0.06 
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AA309272 




gb:E ST 180209 Liver, hepatocellular carci 


2.25 


0.19 
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422406 


AF025441 


Hs.116206 


Opa-interacting protein 5 


9.5 


0.07 
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422487 


AJ010901 
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mucin 4, tracheobronchial 


7.35 


0.04 
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422491 


AA338548 


Hs.1 17546 


neuronatin 


0.64 


1.24 
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gb:EST182167 Jurkat T-cells V Homosapie 


3.6 
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Hs.1 241 65 
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Hs.117852 


ATP-binding cassette, sub-family D (AID) 
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0.14 
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AW972300 


Hs.118110 


bone marrow stromal cell antigen 2 
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Homo sapiens cDNA: FU23548 fis, clone L 
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gb:ESTl83651 Monocytes, stimulated II Ho 
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AA247778 


Hs.1 19155 


Homo sapiens mRNA; cDNA DKF2p434B249 (fr 
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0.07 


upregulate stage 


422762 


AL031320 


Hs.119976 


Human DNA sequence from clone RP1-20N2 o 


5.1 


0.13 
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AK001379 


Hs.121028 


hypothetical protein FU10549 


5.94 


0.1 


upregulate stage 
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Hs.121102 


vanin2 


10 


0.07 
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422892 


AA988176 


Hs.1 21553 


hypothetical protein FU20641 
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0.27 
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Hs.1 594 


centromere protein A (17kD) 


7.2 
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Hs.255895 


ESTs 
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AF026445 


Hs.122752 


TATA box binding protein (TBPJ-associate 


3.05 


0.14 
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Hs.208603 


ESTs 
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0.09 
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BE387529 


Hs.123536 


melanoma antigen, family E, 1, cancer/te 
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gb:EST25873 Cerebellum II Homo sapiens c 
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2.8 
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AA131493 
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fibroblast growth factor 12B 
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0.27 
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Hs.1634 


cell division cycle 25A 
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Hs.1640 
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1JZ 
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frequently rearranged In advanced T-cell 
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Homo sapiens cDNA: FU22219 fis, clone H 
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HSPC135 protein 
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0.69 
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AA364195 




gb:EST75015 Pineal gland (1 Homo sapiens 
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ESTs 
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RAD54, S. cerevisiae, homolog of, B 
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CGI-09 protein 
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upregulate stage 


423500 


AF020763 
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gb:EST32358 Embryo, 12 week 1 Homo sapie 
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Homo sapiens cDNA FU1 1883 fis, clone HE 


8.35 


0.1 
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gb:EST32875 Embryo, 12 week I Homo sapie 


1 


0.43 
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AK000456 


Hs. 130546 


hypothetical protein FU 20449 


10.4 


0.07 
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AF192913 


Hs«1306B3 


zinc finger protein 180 (HHZ168). 


5.75 


0.1 
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Hs.35828 


ESTs 


3.15 


0.18 
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BE003054 


Hs.1695 


matrix metalloproteinase 12 (macrophage 


29.7 


0.02 


upregulate stage 
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AJ403108 


Hs. 1321 27 


hypothetical protein LOC57822 


4.1 


0.16 
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Hs.43222 


ESTs 
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upregulate stage 
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Al 149048 


Hs.30211 


hypothetical protein FLJ22313 


4.25 


0.13 


upregulate stage 
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Y11312 


Hs. 132463 


phosphoinosilide-3-kinase, class 2, beta 


1.18 
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Hs.82124 


laminin, beta 1 
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L39064 


Hs.1702 


interleukin 9 receptor 


3.08 


0.15 
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gb:EST36554 Embryo, 8 week 1 Homo sapien 
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ESTs 
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gb:PM3-DT0037-231299-001-g11 DT0037 Homo 
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AL049328 


It 41 rp A A 

Hs.1 35642 


Homo sapiens mRNA; cDNA DKFZp564E026 (fr 


1 


1 


upregulate stage 


423942 


AF209704 


Hs.1 35723 


glycolipid transfer protein 


11.65 


0.05 


upregulate stage 


423944 


T91433 


Hs.1 28291 


phosphodiesterase 10A 


1 


0.45 


upregulate stage 


423946 


AL1 37344 


Hs.1 35892 


Homo sapiens mRNA; cDNA DKFZp761 11311 (f 


1 


1 


upregulate stage 


423956 


W28203 


Hs.1 361 69 


Homo sapiens clone 25215 mRNA sequence, 


5.35 


0.09 


upregulate stage 


424006 


AF054815 


Hs.1 37548 


CD84 antigen (leukocyte antigen) 


8.8 


0.06 


upregulate stage 


424008 


R02740 


Hs.1 37555 


putative chemokine receptor; GTP-binding 


3.14 


0.19 


upregulate stage 


424012 


AW368377 


Hs.1 37569 


tumor protein 63 kOawith strong homoiog 


2.6 


0.26 


upregulate stage 


424073 


U03493 


Hs.1 38959 


gap junction protein, alpha 7, 45kD (con 


1.8 


0.22 


upregulate stage 


424075 


AI807320 


Hs.227630 


RE1-siiencing transcription factor 


9.1 


0.06 


upregulate stage 


424087 


N69333. 


Hs.21638 


ESTs 


1 


1 


upregulate stage 


424193 


AK002005 


Hs.1 42868 


Homo sapiens cDNA Ft) 11 143 fis, clone PL 


1 


0.23 


upregulate stage 


424353 


AA339646 




gb;EST44755 FetaJ brain I Homo sapiens c 


1 


1 


upregulate stage 


424364 


AW383226 


Hs.201189 


ESTs, Weakly similar to DRPLA [Ksapiens 


2.18 


0.33 


upregulate stage 


424406 


D54120 


Hs. 146409 


wingless-type MMTV integration site fami 


2.05 


0.17 


upregulate stage 


424420 


BE614743 


Hs.1 46688 


prostaglandin E synthase 


1.19 


0.67 


upregulate stage 


424425 


AB031480 


Hs.1 46824 


SPR1 protein 


1.42 


0.54 


upregulate stage 


424486 


BEO02477 


Hs.278714 


chloride intracellular channel 6 


1 


0.27 


upregulate stage 


424490 


AJ278016 


Hs.55565 


ankyrin repeat domain 3 


2.02 


0,39 


upregulate stage 


424492 


AH 33482 


Hs.1 65210 


ESTs 


3.15 


0.14 


upregulate stage 


424505 


AA446131 


Hs.124918 


Homo sapiens cDNA FU13186 fis, clone NT 


11.55 


0.05 


upregulate stage 


424513 


BE385864 


Hs.1 49894 


mitochondrial translation^ initiation f 


265 


0.23 


upregulate stage 


424575 


AL110217 


Hs.1 50751 


DKFZP572C1 63 protein 


1 


1 


upregulate stage 


424583 


AF017445 


Hs.1 50926 


fucose-1 -phosphate guanylyltransferase 


1.8 


0.26 


upregulate stage 


424589 


AW854298 




gb:RC3-CT0254-100500-21 1-c03 CT0254 Homo 


0.85 


0.94 


upregulate stage 


424602 


AK002055 


Hs.301129 


Homo sapiens clone 23859 mRNA sequence 


285 


0.2 


upregulate stage 


424625 


AW904466 


Hs.151310 


PDZ domain protein (DrosophBa inaWike 


1 


0.4 


upregulate stage 


424629 


M90656 


Hs.1 51 393 


glutamate-cysteine ligase, catalytic sub 


1.41 


0.52 


upregulate stage 


424643 


AF241850 


Hs.151428 


ret finger protein 2 


9.75 


0.07 


upregulate stage 


424649 


BE242035 


• Hs.1 51461 


embryonic ectoderm development 


5.85 


0.13 


upregulate stage 


424653 


AW977534 


Hs.1 51469 


calctum/calmodulrn-dependent serine prot 


1 


0.56 


upregulate stage 


424670 


W61215 


Hs.1 16651 


epithelial V-ilke antigen 1 


1.42 


0.52 


upregulate stage 


424690 


BE538356 


Hs.1 51777 


Human translation initiation factor etF- 


4.3 


0.1 


upregulate stage 


424701 


NMJJ05923 


Hs.151988 


mitogen-activated protein kinase kinase 


3.85 


0.11 


upregulate stage 


424702 


AF250237 


Hs.1 52009 


G protein-coupled receptor 85 


1 


1 


upregulate stage 


424717 


H03754 


Hs.1 5221 3 


wingless-type MMTV integration site fami 


4.05 


0.09 


upregulate stage 


424735 


U31875 


Hs.1 52677 


Homo sapiens cDNA FU 20338 fis, cbne HE 


1.04 


0.59 


upregulate stage 


424736 . 


AF230877 


Hs.1 52701 


microtubuie-interacting protein that ass 


1.11 


0.65 


upregulate stage 


424749 


NMJJ02451 


Hs.1 52817 


methylthioadenosine phosphorylase 


1 


1 


upregulate stage 


424834 


AK001432 


Hs.153408 


Homo sapiens cDNA FU10570 fis, clone NT 


18.5 


0.03 


upregulate stage 


424841 


A1280215 


Hs.96885 


ESTs 


1 


1 


upregulate stage 


424860 


W60828 


Hs.153529 


Homo sapiens clone 24540 mRNA sequence 


1 


1 


upregulate stage 


424878 


H57111 


Hs.221132 


ESTs 


9.45 


0.07 


upregulate stage 


424879 


AA348013 


Hs.159354 


ESTs 


10.7 


0.07 


upregulate stage 


424888 


AA348126 


Hs.24882 


ESTs 


28 


0.21 


upregulate stage 


424905 


NMJJ02497 


Hs.153704 


NIMA (never in mitosis gene a)-retated k 


7.75 


0,07 


upregulate stage 


424930 


AA885344 


Hs.96910 


ESTs 


1.45 


0.38 


upregulate stage 


424948 


AA348810 


Hs.1 90503 


ESTs 


3.2 


0.12 


upregulate stage 


424951 


AW964082 




gb:EST376155 MAGE resequences, MAGH Homo 


8.75 


0.09 


upregulate stage 


424993 


F07625 




gbiHSGzurUz i normanzea truant Drain cun 


1 


1 


upregulate stage 


425020 


U09368 


Hs.154205 


zinc finger protein 140 (clone pHZ-39) 


1 


1 


upregulate stage 


425024 


R39235 


Hs.12407 


ESTs 


265 


0.13 


upregulate stage 


425057 


AA826434 


Hs.96944 


ESTs 


1 


0.22 


upregulate stage 


425068 


AL048716 


Hs.154387 


KlAA0103gene product 


9.5 


0.07 


upregulate stage 


425081 


X74794 


Hs.154443 


minichromosome maintenance deficient (S. 


1.66 


0.52 


upregulate stage 


425191 


AF052146 


Hs. 155085 


Homo sapiens done 24653 mRNA sequence 


1 


0.32 


upregulate stage 


425216 


U81504 


Hs.155172 


adaptor-related protein complex 3, beta 


7.05 


0.1 


upregulate stage 


425234 


AW152225 


Hs.165909 


ESTs 


19.7 


0.04 


upregulate stage 


425239 


BE567924 


Hs.155244 


pre-mRNA splicing factor similar to S. c 


1 


0.69 


upregulate stage 


425289 


AW139342 


Hs.1 55530 


interferon, gammwnducible protein 16 


10.45 


0.05 


upregulate stage 


425304 


AA463844 


Hs.31339 


fibroblast growth factor 11 


1.57 


0.51 


upregulate stage 


425316 


AA354977 


Hs.191565 


ESTs. Moderately similar to NSD1 protein 


5.05 


0.12 


upregulate stage 
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425322 


U53630 


Hs.155637 


protein kinase, DNA-acSvated, catalytic 


14.7 


0.05 


upregulate stage 


425362 


AA355936 




gb:EST64410 Jurkat T-cetls VI Hcmosapie 


1 


1 


upregulate stage 


425397 


J04088 


Hs.155346 


topoisomerase (DNA) II alpha (170XD) 


8.24 


0.09 


upreguJate stage 


425403 


AL023753 


Hs.155406 


Human DNA sequence from clone 1 1 98H6 on 


1 


0.22 


upregulate stage 


425415 


M13903 


Hs.1 57091 


invducrin 


1.19 


0.55 


upregulate stage 


425420 


BE536911 


Hs.234545 


ESTs, Weakly similar to AF1 551 35 1 novel 


285 


0.13 


upregulate stage 


425453 


AK000740 


Hs.157988 


hypothetical protein FU20733 


9 


0.07 


upregulate stage 


425455 


L18964 


Hs.1904 


protein kinase C, iota 


9.6 


0.07 


upregulate stage 


425457 


R16484 


Hs.190075 


ESTs 


1 


0.83 


upreguJate stage 


425492 


AL021918 


Hs.158174 


zinc finger protein 184 (KruppeWke) 


3.2 


0.15 


upregulate stage 


425607 


U09860 


Hs.158333 


protease, serine, 7 (enterokinase) 


1 


1 


upregulate stage 


425608 


AA360486 


Hs.92448 


ESTs 


4.7 


0.14 


upregulate stage 


425614 


A1334963 


Hs.156256 


ESTs 


2.65 


0.14 


upregulate stage 


425641 


079758 


Hs.14355 


Homo sapiens cDNA FU 13207 fis, clone NT 


4.85 


0.1 


upregulate stage 


425660 


AA5211B4 


Hs.105504 


ESTs 


1 


0.31 


upregulate stage 
upregulate stage 


425665 


AK001050 


Hs.159066 


hypothetical protein FU101B8 


1.25 


0.19 


425672 


AA361483 




gb:EST70790 T-cell lymphoma Homo sapiens 


1 


1 


upregulate stage 


425707 


AF1 15402 


Hs.11713 


E74-Iike factor 5 (ets domain transcript 


3.96 


0.13 


upregulate stage 


425726 


AF085808 


Hs.159330 


uroplakin 3 


0.92 


0.79 


upregulate stage 


425742 


AJ001454 


Hs.1 59425 


testican3 


1 


1 


upregulate stage 


425785 


T27017 


Hs.1 59528 


Homo sapiens clone 24400 mRNA sequence 


1 


0.39 


upregulate stage 


425811 


AL039104 


Hs.159557 


karyopherin alpha 2 (RAG cohort 1, impor 


1.89 


0.44 


upregulate stage 


425843 


BE313280 


Hs.159627 


death associated protein 3 


3.1 


0.15 


upregulate stage 


425852 


AK001504 


Hs.159651 


death receptor 6 


1.72 


0.47 


upregulate stage 


425883 


AL1 37708 


Hs.161031 


Homo sapiens mRNA; cDNA OKFZp434K0322 (f 


0.95 


0.68 


upregulate stage 


426010 


AA136563 


Hs.1975 


Homo sapiens cDNA: FLI21007 fis, clone C 


1 


0.34 


upregulate stage 


426028 


NMJ)01110 


Hs.172028 


a disintegrin and metafloproteinase doma 


14.3 


0.04 


upregulate stage 


426101 


AL049987 


Hs.166361 


Homo sapiens mRNA; cDNA DKFZp564F1 1 2 {fr 


11.75 


0.05 


upregulate stage 


426108 


AA622037 


Hs.166468 


programmed cell death 5 


3.23 


0.18 


upregulate stage 


426115 


H08895 


Hs.166733 


teucyt/cystinvl aminopeptidase 


1 


0.32 


upregulate stage 


426168 


NM.003152 


Hs.167503 


signal transducer and activator of trans 


1.97 


0.4 


upregulate stage 


426257 


AL137201 


Hs.168625 


KIAA0979 protein 


1 


0.29 


upregulate stage 


426261 


AW242243 


Hs.168670 


peroxisomal farnesylated protein 


2.8 


0.16 


upregulate stage 


426283 


NM_003937 


Hs.169139 


kynureninase (L-kynurenine hydrolase) 


14.75 


0.04 


upregulate stage 


426451 


A1908165 


Hs.169946 


GATArbinding protein 3 


3.05 


0.28 


upregulate stage 


426462 


U59111 


Hs.169993 


dermatan sulphate proteoglycan 3 


1 


0.36 


upregulate stage 


426490 


NM001621 


Hs.170087 


aryl hydrocarbon receptor 


14.17 


0.05 


upregulate stage 


426514 


BE616633 


Hs.301122 


bone morphogenetic protein 7 (osteogenic 


1.15 


0.47 


upregulate stage 


426561 


AA381437 




gb:EST94514 Activated T-cells 1 Homo sap 


5.65 


0.11 


upregulate stage 


426711 


AA383471 


Hs.180669 


conserved gene amplified in osteosarcoma 


11.05 


0,05 


upregulate stage 


426731 


AW303411 


Hs.130332 


ESTs 


2.4 


0.21 


upregulate stage 


426759 


AI590401 


Hs.21213 


ESTs 


. 9.5 


0.06 


upregulate stage 


426786 


AA319798 


Hs.172247 


eukaryotic translation elongation factor 


9.25 


0.09 


upregulate stage 


426788 


U68615 


Hs.172280 


SWl/SNF related, matrix associated, acti 


5.63 


0.14 


upregulate stage 


426818 


AA554827 


Hs.124841 


ESTs, WeaWy stmSar to ALU5_HUMAN ALU S 


9 


0.08 


upregulate stage 


426824 


D87717 


Hs.172652 


KiAA0013gene product 


1 


0.87 


upregulate stage 


426827 


AW067805 


Hs.172665 


methylenetetrahydrofolate dehydrogenase 


8.95 


0.09 


upregulate stage 


426921 


AA037145 


Hs.172865 


cleavage stimulation factor, 3* pre-RNA, 


1 


0.19 


upregulate stage 


426935 


NM_000088 


Hs.172928 


collagen, type 1, alpha 1 


1.15 


0.72 


upregulate stage 


426997 


BE620738 


Hs.1 731 25 


peptidylpnolyl isomerase F (cyclophilin 


11.06 


0.06 


upregulate stage 


427071 


AA397958 


- Hs.192719 


ESTs 


5.75 


0.08 


upregulate stage 


427126 


AA620613 


Hs.191827 


ESTs 


2.55 


0.18 


upregulate stage 


427134 


AA398409 


Hs.173561 


EST 


3.4 


0.18 


upregulate stage 


427142 


AA39B510 


Hs.133148 


ESTs 


1 


0.25 


upregulate stage 


427259 


AA400096 




gbzu69fl)7.s1 Soares_teslis_NHT Homo sap 


1 


0.22 


upregulate stage 


427308 


D26067 


Hs.174905 


KIAA0033 protein 


5.9 


0.1 


upregulate stage 


427315 


AA1 79949 


Hs.175563 


Homo sapiens mRNA; cDNA DKFZp564N0763 (f 


6.12 


.0.11 


upregulate stage 


427356 


AW023482 


Hs.97849 


ESTs 


2.7 


0.13 


upregulate stage 


427370 


A1243615 


Hs.97740 


ESTs 


16 


0.14 


upregulate stage 


427376 


AA401533 


Hs.19440 


ESTs 


21 


0.16 


upregulate stage 


427387 


BE244966 


Hs.1 77584 


3-oxoarid CoA transferase 


1 


0.39 


upregulate stage 


427470 


AW999924 


Hs.178357 


Homo sapiens cDNA FU13657 fis, clone PL 


3.4 


0.16 


upregulate stage 


427519 


AW085233 


Hs.1 80696 


ESTs 


8.23 


0.1 


upregulate stage 


427521 


AW973352 


Hs.299056 


ESTs 


7.75 


0.1 


upregulate stage 


427528 


AU077143 


Hs.1 79565 


minichromosome maintenance deficient (S. 


5.7 


0.15 


upregulate stage 


427566 


AI743515 




gbwf72b08.x2 Soares_NFU_GBC_S1 Homo s 


1 


1 


upregulate stage 


427581 


NMJM788 


Hs.179703 


KJAA0129 gene product 


11.45 


0.06 


upregulate stage 


427585 


D31152 


Hs.179729 


collagen, type X, alpha 1 (Schmid metaph 


3.25 


0.16 


upregulate stage 


427603 


AI090838 


Hs.98006 


ESTs 


1 


1 


. upregulate stage 


427646 


AI678042 


Hs.271953 


ESTs 


5.7 


0.11 


upregulate stage 


427652 


A1673025 


Hs.43874 


ESTs 


1 


0.34 


upregulate stage 


427742 


AA411880 


Hs.190888 


ESTs 


24 


0.16 


upregulate stage 
upregulate stage 


427814 


W28383 


Hs.180900 


WiBiams-Beuren syndrome chromosome regi 


9.13 


0.08 


427839 


AA608823 


Hs.98244 


ESTs 


1.9 


0.19 


upregulate stage 


427878 


C05766 


Hs.181022 


CGM)7 protein 


4.1 


0.14 


upregulate stage 


427922 


AK001934 


Hs.181112 


HSPC126 protein 


26 


0.19 


upregulate stage 


427933 


AW974643 


Hs.190571 


ESTs 


4.55 


0.14 


upregulate stage 
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427934 


a a (unci 1 

AA810541 


Hs. 291 8 66 


ESTs 


1 


1 


upregutate stage 


427944 


A A A 4 IftlO 

AA417878 


rls.48401 


^ til J— —1.1. » — I— .fl— 1*. A t 1 Ift I 1ft Ift A A kt Al 1 1 O 

ESTs, Weakly simflar to ALU8_HUMAN ALU 5 


6.15 


0.1 


upregulate stage 


427961 


AW293165 


Hs.143134 


ESTs 


4.65 


0.11 


upregutate stage 


427986 


N45214 


Hs.282387 


Homo sapiens cDNA: FLI2JB37 fis, clone H 


3.55 


0.13 


upregulate stage 


428003 


AL1 10200 


Hs.181384 


Homo sapiens mRNA; cDNA DKFZp586B0922 (f 


1.45 


0.36 


upregulate stage 


428004 


AA449563 


Hs.300270 


ESTs 


3.95 


0.12 


upregulate stage 


428010 


AA806554 


Hs. 185375 


ESTs 


1 


0.38 


upregulate stage 


428057 


AI343641 


Hs. 185798 


ESTs 


10.1 


0.06 


upregulate stage 


428058 


AI821625 


Hs.191602 


ESTs 


1 


0.5 


upregulate stage 


428071 


AF212848 


Hs. 182339 


ets homologous factor 


6.4 


0.09 


upregulate stage 
upregulate stage 


428182 


BE386042 


Hs.293317 


ESTs. Weakly similar to JM27 [H.sapiens] 


1 


0.23 


428192 


AA424051 




gfczv80dO3.s1 SoaresJotalJelus_Nb2HF8_ 


. 2.45 


0.16 


upregulate stage 


428227 


AA321649 


Hs.2248 


small inducible cytokine subfamily B (Cy 


9.25 


0.04 


upregulate stage 


428403 


A1393048 


Hs.239894 


leucine rich repeat (in FLU) interacts 


9.94 


0.06 


upregulate stage 


428436 


BE080180 




gb:RC4-BT0629-1 20200-01 1-b10 BT0629 Homo 


1 


1 


upregulate stage 


428450 


NM.014791 


Hs. 184339 


KIAA01 75 gene product 


4.43 


0.16 


upregulate stage 


428479 


Y00272 


Hs. 184572 


cell division cycle 2, G1 to S and G2 to 


9.2 


0.07 


upregulate stage 


428529 


AW262022 


Hs. 106278 


Homo sapiens cDNA FU12839 fis, clone NT 


1 


1 


upregulate stage 


428576 


AW009330 


Hs.1 67621 


ESTs 


1 


0.3 


upregulate stage 


428605 


AB037862 


Hs. 188756 


KIAA1441 protein 


9.25 


0.09 


upregulate stage 


428664 


AK001666 


Hs. 189095 


similar to SALL1 (sal (Drosophila}-lite 


3.8 


0.08 


upregulate stage 


428685 


AF131853 


Hs.1 89527 


Homo sapiens clone 2501 6 mRNA sequence 


1 


1 


upregulate stage 


428716 


AL122118 


Hs. 1906 14 


Homo sapiens mRNA; cDNA DKFZp43401221 (f 


1 


0.65 


upregulate stage 


428783 


AW070204 


Hs.178176 


ESTs 


1.6 


0.23 


upregutate stage 


428788 


AF082283 


Hs.1 93516 


B-cell CLLflymphoma 10 


9.6 


0.08 


upregulate stage 


428829 


R14050 


Hs. 194051 


Homo sapiens mRNA; cDNA OKF2p566B213 (fr 


5.4S 


0.11 


upregutate stage 


428839 


AI767756 


Hs.82302 


ESTs 


10 


0.06 


upregutate stage 


428881 


AI298368 


Hs.98918 


ESTs 


1.4 


0.18 


upregulate stage 


428954 


AF1 00781 


Hs.1 94878 


WNT1 Inducible signaling pathway protein 


1 


1 


upregulate stage 


428988 


AA442900 


Hs.27947 


ESTs 


3.05 


0.13 


upregutate stage 


429042 


AW015489 


Hs.235920 


ESTs 


1 


0.56 


upregulate stage 


429057 


AF1 56557 


Hs.194816 


stomatin-like protein 1 


0.95 


0.93 


upregulate stage 


429066 


AA868555 


Hs.1 78222 


ESTs 


6 


0,11 


upregulate stage 


429072 


AI376228 


Hs.1 08043 


Friend leukemia virus integration 1 


1 


1 


upregulate stage 


429083 


Y09397 


Hs.227817 


BCL2-reiated protein A1 


11.12 


0.03 


upregulate stage 


429091 


AA935658 


Hs.1 87939 


ESTs 


8.9 


0.08 


upregulate stage 


429115 


AA446728 


Hs.289020 


Homo sapiens cDNA FLJ14098 fis, clone MA 


4.1 


0.17 


upregulate stage 


429127 


AA749382 


Hs.1 07233 


ESTs 


1 


0.23 


upregutate stage 


429135 


AA446966 


Hs.99090 


ESTs, Moderately similar to similar to K 


1 


1 


upregulate stage 


429170 


NM_001394 


Hs.2359 


dual specificity phosphatase 4 


8.6 


0.08 


upregulate stage 


429174 


BE559598 


Hs.197803 


KIAA01 60 protein 


8.4 


0.06 


upregulate stage 


429236 


AA448407 




gbzw68d1 1 .s1 Scares Jestis.NHT Homo sap 


1 


0.36 


upregulate stage 


429268 


AA205386 


Hs.1 98481 


RAR-related orphan receptor B 


2.9 


0,16 


upregulate stage 


429300 


AB011108 


Hs.1 98891 


serine/threonine-protein kinase PRP4 horn 


4.25 


0.15 


upregulate stage 


429334 


D63078 


Hs.1 86180 


Homo sapiens cDNA: FU23038 fis, clone L 


2.95 


0.11 


upregulate stage 


429344 


R94038 


Hs.199538 


inhibin, beta C 


291 


0.28 


upregulate stage 


429359 


W00482 


Hs.2399 


matrix metaltoprotetnase 14 (membrane-in 


1.19 


0.68 


upregulate stage 


429376 


A1867889 


Hs.43227 


ESTs 


1 


1 


upregulate stage 


429412 


NMJ06235 


Hs.2407 


POU domain, class 2, associating factor 


8.15 


0.07 


upregutate stage 


429450 


AA824451 


Hs,94292 


Homo sapiens cDNA: FU2331 1 fis, clone H 


3.3 


0.17 


upregulate stage 
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Homo sapiens cDNA FU13841 fc, clone TH 
ESTs 
ESTs 

roundabout {axon guidance receptor, Dros 

gb:aa32c1U1 NC1_CGAP_GCB1 Homo sapiens 

ESTs, Weakly similar to contains similar 

contain 5 

ESTs 

ESTs 

ESTs, Weakly similar to tumor suppressor 
CGM9 protein 

serine carboxypepBdase 1 precursor prot 
gb:PM1-HT0527-28Q20(M)05-a05 HT0527 Homo 
nuclear cap binding protein subunil 2, 2 
chloride channel, calcium activated, fam 
ESTs. Moderately similar to tetracycDne 
U6 snRNA-associated Sm-I&e protein LSm8 
F-box only protein 4 
hypothetical protein RJ20055 
gb:RC1-BT0314-31030(W)15-b06 BT0314Homo 
ESTs 

hypothetical protein FU10902 

cystatinA(stefinA) 

ESTs 

S100 caiciunvbinding protein A1 1 (calgiz 
gbzv29c06j1 Soares ovary tumor NbHOT H 
ESTs 

Human DNA sequence from done 336H9 on c 

ESTs 

ESTs 
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ESTs 

ribosomal protein L27a 
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5^ydroxytryptamine (serotonin) receptor 
ESTs 

gap junction protein, beta 6 {connexin 3 
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transcription factor 19 (SC1) 
ESTs 
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ESTs 

ESTs 
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ESTs 

ESTs 
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ESTs 
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insulin-like growth factor 2 (scmaiomedi 
ESTs 

hypothetical protein DKFZp56401278 

ESTs 

ESTs 

hypothetical protein FU10842 
splicing factor proBne/glutamine rich ( 
Sec23 (S. cerevisiae) homoJog A 
hypothetical protein FU101 1 0 
Apg12 (autophagy 12, S. cerevisiaeHrke 
integrin, alpha 3 (antigen CD49C, alpha 
TBX3^so protein 
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UDP-Gal:betaGlcNAc beta 1,3^alactosyttr 
ESTs 
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nuclear receptor coacfivator 4 
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ESTs 
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ESTs 
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ESTs 
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ESTs 

ESTs 

interleukin 1 receptor accessory protein 
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protein phosphatase 4 (formerly X), cata 
ESTs 
ESTs 
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hypothetical protein FU20413 
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ESTs 

hypothetical protein FU11064 
matrix metalloproteinase 13 (collagenase 
hypothetical protein FU10377 
ESTs 
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S100 calcium-binding protein P 
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Sp4 transcription factor 

ESTs 
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granzyme K (serine protease, granzyme 3; 
ESTs 
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ESTs, Weakly similar to unnamed protein 
ESTs 

ER-associated DNAJ; ER-assoriated Hsp40 
ESTs 
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ESTs 
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MARK 
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ESTs 
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BE281 1 65 Hs.288038 TL^associated serine-argjnine protein 1 
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M608955 Hs.109653 ESTs 
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AF113017 Hs.284301 PR01 268 protein 
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442216 AI733468 Ks.129006 ESTs 

442295 AI827248 Hs.224398 Homo sapiens cONA FU11469 fis, done HE 
442319 BE048144 Hs.177677 CGM02 protein 
442432 BE093S89 Hs.38178 Homo sapiens cDNA FU23468 fis. done H 

442510 AF150179 . Hs.249890 ESTs 

442518 AF150226 gb:AF1 50226 Human mRNA from cd34+ stem c 

442539 AL119505 Hs.58220 Homo sapiens cONA: FU 23005 fis, clone L 
442552 R20624 Hs.83572 son ofsevenless (Drosophila) homotog 1 
442562 BE3795B4 Hs.34789 ESTs 
442564 AI590207 Hs.18B378 ESTs 
442577 AA292998 Hs.1 63900 ESTs 
442590 A1002686 Hs.130313 ESTs 
442597 AM99214 Hs.130825 ESTs 

442611 BE077155 Hs.177537 ESTs 

442612 AJ005233 Hs.130631 ESTs 

442619 AA447492 Hs.20183 ESTs, Weakly similar to AF1 64793 1 prote 
442642 R51853 Hs.226429 ESTs 
442660 AW138174 Hs.130651 ESTs 

442696 BE566962 Hs.7063 Homo sapiens cDNA: FU20913 fis, done A 
442712 BE465168 Hs.131011 ESTs 

442760 BE075297 Hs.10067 ESTs, Weakly similar to KIAA 1205 protein • 
442769 AW243058 Hs.131155 ESTs 
442785 AW296625 Hs.131188 ESTs 
442806 AW294522 Hs.149991 ESTs 

442856 H56735 Hs.282958 Homo sapiens cDNA FLJ1361 1 fis, clone PL 

442861 AA243837 Hs.57787 ESTs 

442875 BE623003 Hs.23625 Homo sapiens done TCCCTA00142 mRNA sequ 

442879 AF032922 Hs.8813 syntaxin binding protein 3 

442883 AW195774 Hs.253199 ESTs 

442961 BE614474 Hs.289074 Homo sapiens cDNA RJ 13986 fis, done Y7 

442966 AI394036 Hs.132237 ESTs, Weakly simBar to dual specificity 

442980 AA857025 Hs.8878 kinesin* 1 

442992 AI914699 Hs.13297 ESTs 

442994 AI026718 Hs.16954 ESTs 

443054 AI745185 Hs.8939 yes-assodated protein 65 kDa 

443113 AHM0686 Hs.132908 ESTs 

443119 AA312264 Hs.7980 ESTs. Moderately similar to ALU4 HUMAN A 

443171 BE281128 Hs.9030 TONDU 

443211 AI128388 Hs. 143655 ESTs N 

443242 BE243910 Hs.9082 nudeoporin P 54 

443243 A1452496 Hs.132G56 ESTs 

443247 BE614387 Hs.47378 ESTs, Moderately similar to hypothetical 

443270 NWL004272 Hs.9192 Homer, neuronal immediate early gene, 1B 

443299 AI733642 Hs, 133042 ESTs 

443362 AI053464 Hs.166505 ESTs 

443383 AI792453 Hs.166507 ESTs 

443411 AW134566 Hs.65320 ESTs 

443426 AF098158 Hs.9329 chromosome 20 open reading frame 1 

443447 AI094222 Hs.166572 ESTs 

443542 A1927065 Hs.146040 ESTs 

443557 AV645987 Hs.145681 ESTs 

443584 AI807036 Hs.101619 ESTs 

443606 AI07B664 Hs.199424 ESTs 

443634 H73972 Hs.134460 ESTs 

443640 AI872643 Hs.134218 ESTs 

443715 AI583187 Hs.9700 cydinEI 

443799 AA150320 Hs.9800 protein kinase Njmu-R1 

443899 AW842283 Hs.79933 cyclinl 

443917 AW503739 Hs.72325 Human DNA sequence from done RP1-187J11 

443919 AI091284 Hs.135224 ESTs 

443967 AW294013 Hs.200942 ESTs 

443977 AL120986 Hs.150627 ESTs 

443979 AV647366 Hs.282365 ESTs 

444020 R92962 Hs.35052 ESTs 

444105 AW189097 Hs.166597 ESTs 

444129 AW294292 Hs.256212 ESTs 

444152 AI125694 Hs.149305 Homo sapiens cDNA FU 14264 fis, done PL 

444163 AI126098 gb:qc54g07j(1 Soaresj)!acenta_8to9weeks 

444166 AV648429 Hs.282393 ESTs 

444270 AI138580 Hs.255220 EST 

444271 AW452569 Hs.149804 ESTs 

444282 AI138955 gb^d79b07.x1 Soares.testis^NHT Homo sap 

444333 AI262567 Hs.253801 trinudeotide repeat containing 15 

444342 NftUM4398 Hs.1 0837 similar to fysoscme-associated membrane 

444378 R41339 Hs.12569 ESTs 

444381 BE387335 Hs.283713 ESTs, Weakly similar to CA54 HUMAN COUA 

444431 AW513324 Hs.42280 ESTs 
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upregulate stage 


446127 


AA333608 


Hs.1 3980 


ubiquitously transcribed tetratricopepti 


1 


0.25 


upregulate stage 


446152 


A1292036 


Hs.1 50026 


ESTs 


4.7 


0.12 


upregulate stage 


446196 


AI744888 


Hs.149470 


ESTs 


1 


0.83 


upregulate stage 


446229 


AI744964 


Hs.1 4449 


KIAA 1609 protein 


2.4 


0.36 


upregulate stage 


446248 


AI283014 


Hs.1 49638 


ESTs 


1 


1 


upregulate stage 


446269 


AW263155 


Hs.14559 


hypothetical protein ruiuiwu 


1U.H5 


U.U7 


upreguioio stage 


446303 


X77244 


Hs.14732 


malic enzyme 1, NADPf+fdependent, cytos 


1 


1 


upregulate stage 


446312' 


BE087853 




gb:QV1 -BT0681 -290400-1 81-h05 BT0681 Homo 


11.75 


0.06 


upregulate stage 


446332 


AK001635 


Hs.14638 


hypothetical protein FU10773 


6.45 


0.1 


upregulate stage 


446356 


A1816736 


Hs.14896 


DHHC1 protein 


8.9 


0.08 


upregulate stage 


446362 


AW612481 


Hs.255914 


ESTs 


7.6 


0.1 


upregulate stage 


446398 


AI681317 


Hs.150074 


ESTs 


1 


1 


upregulate stage 


445411 


AI298828 


Hs.153439 


ESTs 


1 


0.37 


upregulate stage 


446474 


A1301227 


Hs.150186 


ESTs 


3.35 


0.13 


upregulate stage 


446501 


AI302616 


Hs.150819 


ESTs 


4.25 


0.12 


upregulate stage 


446507 


AA352554 


Hs.15164 


nudear DNA-binding protein 


11.25 


0.06 


upregulate stage 


446526 


H89616 


Hs-296290 


Homo sapiens cDNA FU13357 fis, clone PL 


10.25 


0.07 


upregulate stage 


446555 


AV659046 


Hs.201847 


ESTs 


1 


1 


upregulate stage 
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446577 AB040933 
446629 AI436046 
446636 AC002563 
446682 AW205632 
446701 AK001621 
446718 AV660019 
.446719 W39500 
446720 AI439136 
446765 AV660348 
446771 AA128965 
446821 W03766 
446830 BE179030 
446839 BE091926 
446853 AV660630 
446880 AI811807 
446922 BE175605 
446950 AA305B00 

446988 AW339533 

446989 AK001898 
447052 AV651911 
447065 AI829014 
447069 A1359927 
447078 AW885727 

447080 AI418781 

447081 Y13896 
447118 AB014599 

447153 AA805202 

447154 H52284 
447159 A1685286 
447215 BE617056 
447228 AW192200 
447258 BE047911 
447286 AW197097 
447289 AW247017 
447334 AA515032 

447342 AI199268 

447343 AA255641 
447376 A1376747 
447397 BE247676 
447430 AI742989 
447444 AK000318 
447519 U46258 
447522 BE143888 
447578 AA912347 
447606 AI588954 
447688 N87079 
447701 BE619526 
447741 AI421737 
447748 AI422023 
447827 U73727 
447881 BE620886 
447963 AI452973 

447977 AM57097 

447978 AI457098 
447982 H22953 
448032 AW511770 
448045 AJ297436 
448058 AI458998 
448062 AW295923 
443138 AW847925 
448154 AL120320 
448165 NMJJ05591 
448168 AW605999 
448236 AA890449 
448256 BE614149 
448289 AW390251 

448356 AL120837 

448357 N20169 
448408 AA322866 
448455 AI252625 
448459 AW069838 
448454 AI522053 
448468 BE550361 
448502 AW805285 
448552 AW973653 
448556 AW885606 
448559 BE382657 



Hs.15420 K1AA1500 protein 

Hs.1 561 48 Homo sapiens cDNA: FU 23082 fis, clone L 

Hs.15767 citron (rho-interacting, serine/lhreonin 

Hs.211198 ESTs 

Hs.15921 hypotheScaf protein RJ10759 

Hs.282676 ESTs 

Hs.47305 ESTs. Weakly similar to LONN.HUMAN MITOC 

Hs.140546 ESTs 

Hs.282688 ESTs 

Hs.60679 TATA box binding protein (TBP)-assocrate 

Hs.301482 ESTs 

Hs.64239 Human DNA sequence from clone RP5-1 1 74N9 

Hs. 1 6244 mitotic spindle cofled-coil related prot 

Hs.87627 disrupterof silencing 10 

Hs.108646 Homo sapiens cDNA FU 12534 fis, clone NT 

gb:RC5-HT0580-10050(M)22^07 HT0580 Homo 

Hs.293454 ESTs, Weakly simflar to Similarity lo Ye 

Hs.272108 ESTs 

Hs.16740 hypothetical protein FU11036 

HsJ282735 ESTs 

Hs.158678 ESTs 

Hs.1 57722 ESTs 

Hs.301570 ESTs 

Hs.300144 ESTs 

Hs.1 7287 potassium inwanfly-fectirying channel s 

Hs.17411 K1AA0699 protein 

Hs. 1 7391 2 eukaryotic translation initiation factor 

Hs.293545 ESTs 

Hs.280386 EST 

Hs.283000 ESTs 

Hs.158188 ESTs 

gbta44a05.y1 NCLCGAP_Brn52 Homosapien 

Hs.183858 transcriptional intermediary factor 1 

Hs.36978 melanoma antigen, famity A, 3 

Hs.91109 ESTs 

Hs.19322 ESTs 

Hs.236894 ESTs, Highly similar to IRP1.HUMAN LOW-D 

gb:tc35h05.x1 SoaresJotalJetus_Nb2HF8 

Hs.18442 E-1 enzyme 

Hs.206112 ESTs 

Hs.18616 hypothetical protein FU2031 1 

Hs.23448 ESTs 

gb:Ml^HT0165-081199401-b04 HT0165 Homo 

Hs.136585 ESTs 

Hs.170995 ESTs 

Hs.19236 NADH dehydrogenase (ubiquinone) 1 beta s 

Hs.255527 ESTs, Weakly similar to unnamed protein 

Hs.167253 ESTs 

Hs.161338 ESTs 

Hs.19718 protein tyrosine phosphatase, receptor t 

Hs.23037 ESTs 

Hs.165900 ESTs, Weakly similar to ALUC HUMAN !!!! 

Hs.255906 ESTs 

Hs.280848 ESTs 

Hs.137551 ESTs 

Hs.246868 ESTs 

Hs.20166 prostate stem cell antigen 

Hs.170424 ESTs 

tfs.255472 ESTs 

Hs.1 70736 ESTs 

Hs.203230 ESTs 

Hs.202379 meiotic recombination (S. cerevisiae) 1 1 

Hs.22549 hypothetical protein FU12799 

Hs.20766 oxysterol 7alpha-hydroxylase 

Hs.20814 CGI-27 protein 

Hs.202402 ESTs 

Hs.20993 higrhglucose-fegulated protein 8 

Hs.108923 ESTs 

Hs.21107 neuroRgin 

Hs.269860 ESTs 

Hs.171055 ESTs 

Hs.196093 ESTs 

Hs.171072 ESTs 

Hs.239699 ESTs 

Hs.20104 hypothetical protein FU00052 

Hs.5064 ESTs 

Hs.21 486 signal transducer and activator of trans 
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44B532 


BE614269 




gb:601 50431 1T1 NIH_MGC_71 Homo sapiens c 


1 


1 


upregulate stage 


448643 


AI557531 




gb:pl2.1-06.D06.r tumor2 Homo sapiens cD 


3.6 


0.14 


upregulate stage 


446649 


T94590 


Hs.222855 


ESTs 


1.95 


0.21 


upregulate stage 


44B663 


BE614599 


Hs. 106823 


Rsapiens gene from PAC 426I6, similar t 


4.3 


0.12 


upregulate stage 


448680 


AW245890 


Hs.21753 


JM5 protein • 


0.97 


0.93 


upregulate stage 


448725 


AA193251 


Hs.40289 


ESTs 


2.6 


0.19 


upregulate stage 


448729 


BE614535 


Hs.1385B0 


ESTs, Weakly similar to ALU5.HUMAN ALU S 


3.25 


0.16 


upregulate stage 


448743 


AB032962 


Hs.21896 


KIAA1136 protein 


1.9 


0.19 


upregulate stage 


448826 


A1580252 


Hs.293246 


ESTs, Weakly similar to putative p150 (H 


1.78 


0.44 


upregulate stage 


448914 


AI927656 


Hs, 196459 


ESTs 


275 


0.19 


upregulate stage 


448946 


AI652655 


Hs.1 55796 


ESTs 


9.7 


0.07 


upregulate stage 


448958 


AB020651 


Hs.22653 


Kl AA0844 protein 


1 


0.18 


upregulate stage 


448974 


AL049390 


Hs.22689 


Homo sapiens mRNA; cDNA DKFZp58601318 (( 


5.85 


0.11 


upregulate stage 


448979 


AI611378 


Hs.192610 


ESTs 


1 


1 


upregulate stage 


449008 


AW578003 


Hs.22826 


tropomodulin 3 (ubiquitous) 


5.2 


0.11 


upregulate stage 


449032 


AA045573 


Hs.22900 


nuclear factor (erythroid-derived 2)-lik 


1 


0.33 


upregulate stage 


449053 


A1625777 


Hs.270344 


ESTs 


5.73 


0.12 


upregulate stage 


449057 


AB037784 


Hs.22941 


KIAA1363 protein 


9.25 


0.07 


upregulate stage 


449148 


AW836677 


Hs.287564 


Homo sapiens cDNA FU13345 fis, clone OV 


7.2 


0.09 


upregulate stage 


449203 


A1634578 


Hs.282121 


ESTs 


7 


0.1 


upregulate stage 


449207 


AL044222 


Hs.23255 


nucleoporin 155kD 


234 


0.36 


upregulate stage 


449219 


AI637561 


Hs.195012 


ESTs 


1 


1 


upregulate stage 


449230 


BE613348 


Hs.2334B 


S-phase kinase-assodated protein 2 (p45 


3.08 


0.25 


upregulate stage 


449246 


AW411209 


Hs.23363 


hypothetical protein FU 10983 


4.79 


0.16 


upregulate stage 


449318 


AW236021 


Hs.108788 


ESTs, Weakly similar to zeste [D.melanog 


2.8 


0.16 


upregulate stage 


449328 


AI962493 


Hs.197647 


ESTs 


2.55 


0.17 


upregulate stage 


449343 


AM51418 


Hs.272458 


protein phosphatase 3 (formerly 2B), cat 


4.75 


0.12 


upregulate stage 


449344 


AI640355 




gb:wa17cQ4.x1 NCI_CGAPJ<id11 Homosapien 


2.1 


0.22 


upregulate stage 


449351 


AW016537 


Hs.200760 


ESTs 


2.45 


0.14 


upregulate stage 


449370 


AK002114 


Hs.23495 


hypothetical protein RJ11252 


1.55 


0.14 


upregulate stage 


449424 


AW448937 


Hs.197030 


ESTs 


4.05 


0.12 


upregulate stage 


449425 


AW 103433 


Hs.1 95684 


ESTs 


4.6. 


0.12 


upregulate stage 


449434 


AW294858 


Hs.1 97641 


ESTs 


1 


0.29 


upregulate stage 


449437 


AI702038 


Hs.100057 


Homo sapiens cONA: FU22902 fis, clone K 


2.38 


0.34 


upregulate stage 


449474 


AA019344 


Hs.2055 


ubiquitin-activating enzyme E1 (A1S9T an 


5.9 


0.12 


upregulate stage 


449523 


NM_000579 


Hs.54443 


chemokine (C-C moBf) receptor 5 


6.45 


0.1 


upregulate stage 


449528 


H63337 


Hs.38178 


Homo sapiens cDNA: FU23468 us, clone H 


2.85 


0.18 


upregulate stage 


449565 


A1824925 


Hs.197066 


ESTs 


1 


1 


upregulate stage 


449568 


AL1 57479 


Hs.23740 


KIAA1 598 protein 


10.15 


0.05 


upregulate stage 


449618 


AI076459 


Hs.14366 


Homo sapiens cDNA FLJ12819 fis, done NT 


11.7 


0.06 


upregulate stage 


449639 


AA001968 


Hs.59956 


ESTs, Highly similar to MGR7JWMAN METAB 


1 


1 


upregulate stage 


449666 


AA002047 




gb:zh84e05.r1 SoaresJeta)Jiver_spleen_ 


1.85 


0.33 


upregulate stage 
upregulate stage 


449704 


AK000733 


Hs.23900 


GTPase activating protein 


2.82 


0.3 


449722 


BE280074 


Hs.23960 


cycCnBI 


6.44 


0.12 


upregulate stage 


449764 


N93104 


Hs.54895 


ESTs, Weakly similar to 2NF91L [H.sapien 


1 


1 


upregulate stage 


449784 


AW161319 


Hs.12915 


ESTs 


6.25 


0.11 


upregulate stage 


449829 


N51440 


Hs.47261 


ESTs 


1 


0.57 


upregulate stage 


449843 


R85337 


Hs.24030 


solute carrier family 31 (copper transpo 


10.2 


0.07 


upregulate stage 


449892 


N73608 


Hs.50309 


ESTs 


6.5 


0.1 


upregulate stage 


449894 


AK001578 


Hs.24129 


hypothetical protein FU10716 


4.55 


0.12 


upregulate stage 


449919 


AI674685 


Hs.200141 


ESTs 


5.3 


0.11 


upregulate stage 


450020 


AI680684 


Hs.282219 


ESTs 


1 


1 


upregulate stage 


450033 


R43010 


Hs.269452 


ESTs, Weakly similar to JH0148 nudeolin 


1 


0.65 


upregulate stage 


450063 


AI681509 


Hs.277133 


ESTs 


4.2 


0.17 


upregulate stage 


450083 


AA131795 


Hs.142001 


ESTs 


3.9 


0.16 


upregulate stage 


450116 


AA005355 


Hs.222882 


ESTs 


1 


1 


upregulate stage 


450121 


AL040174 


Hs.288927 


Homo sapiens cDN A: FU22944 fis, clone K 


1 


1 


upregulate stage 


450135 


AI810816 


Hs.201142 


ESTs 


4.95 


0.14 


upregulate stage 


450144 


T63961 


Hs.301851 


ESTs 


2.75 


0.13 


upregulate stage 


450149 


AW969781 


Hs.293440 


ESTs, Moderately similar to Z!C2 protein 


3.75 


0.14 


upregulate stage 


450151 


AI088196 


Hs.295233 


ESTs 


251 


0.28 


upregulate stage 


450152 


AI138635 


Hs.22968 


ESTs 


2.45 


0.15 


upregulate stage 


450195 


AA007352 


Hs.256042 


ESTs 


4.1 


0.14 


upregulate stage 


450221 


AA328102 


Hs.24641 


cytoskeleton associated protein 2 


1 


0.3 


upregulate stage 


450238 


T89693 


Hs.1 38777 


ESTs 


12.2 


U.U7 


upregulate stage 


450257 


AW820313 




gb:GV2-ST0296-150200-028-d02 ST0295 Homo 


1 


1 


upregulate stage 


450313 


AI038989 


Hs.24809 


hypothetical protein FU10826 


4.35 


0.15 


upregulate stage 


450314 


AA574309 


Hs.283402 


TCReta 


10.1 


0.07 


upregulate stage 


450350 


T97817 


Hs.174880 


ESTs 


3.65 


0.1 


upregulate stage 


450411 


D61167 


Hs.202156 


ESTs 


1 


0.67 


upregulate stage 


450447 


AF212223 


Hs.25010 


hypothetical protein P15-2 


10.75 


0.07 


upregulate stage 


450443 


D54299 


Hs.36244 


ESTs 


1 


1 


upregulate stage 


450449 


A1696596 


Hs.202068 


ESTs 


1 


1 


upregulate stage 


450506 


NM_004460 


Hs.418 


fibroblast activation protein, alpha 


11.45 


0.05 


upregulate stage 


450573 


AW964334 




gb:EST376407 MAGE resequences, MAGH Homo 


1.2 


0.2 


upregulate stage 


450628 


AW382884 


Hs.204715 


ESTs 


4.95 


0.13 


upregulate stage 


450636 


A1703076 


Hs.201959 


ESTs 


1 


0.69 


upregulate stage 
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450555 
. 450664 
450680 
450722 
450751 
450772 
450800 
450824 
450832 
450870 
450937 
450983 
451052 
451067 
451088 
451094 
451096 
451126 
451161 
451166 
451222 
451225 
451228 
451246 
451266 
451276 
451277 
451291 
451326 
451347 
451359 
451365 
451386 
451440 
451487 
451492 
451495 
451535 
451553 
451562 
451580 
451592 
451651 
451658 
451684 
451690 
451724 
451743 
451794 
451844 
451903 
451914 
451938 
451939 
451971 
451998 
452028 
452036 
452099 
452122 
452163 
452179 
452198 
452206 
452234 
452240 
452247 
452250, 
452256 
452266 
452277 
452281 
452291 
452328 
452331 
452345 
452367 



AI707B46 
AA808358 
AF131784 
AI732318 
AI733251 
BE326391 
BE395161 
R09055 
AW970602 
AA011471 
R49131 
AA305384 
AA281504 
BE172186 
AA015600 
A1949825 
BE383234 
H30600 
AA211329 
T98171 
AA0183B6 
AI433694 
AJ767166 
AW189232 
AA016292 
AW294386 
AK001123 
R39288 
AW296946 
A1288679 
H85334 
A1791783 
ABO29006 
AA017599 
AA018072 
AA018119 
H86887 
AW970577 
AA018454 
H04150 
AW138195 
AI805416 
A!097337 
AW195351 
AF216751 
AW451469 
A1903765 
AW074266 
AA019799 
T61430 
W19617 
A1822115 
AI354355 
U80456 
AA021185 
AW594129 
AK001859 
NM_003966 
BE612992 
AF216833 
AI863140 
H16725 
AI097560 
AW340281 
AW084176 
AI591147 
AL137432 
BE618654 
AK000933 
AI767250 
AL049013 
T93500 
AF015592 
AA805679 
AA598509 
AA293279 
U71207 



Hs.279860 
Hs.36830 
Hs.25318 
Hs.101120 
Hs.126853 
Ks.280146 
Hs.243963 
Hs.269204 
Hs.105421 

Hs.26267 
Hs.25740 
Hs.24444 
Hs.180789 
Hs.82415 
Hs.260395 
Hs.25925 
Hs.40910 
Hs.26006 
Hs.185675 
Hs.64341 
Hs.293608 
Hs.207025 
Hs.39140 
Hs.290849 
Hs.236533 
Hs.26176 
Hs.6702 
Hs.300967 ' 
Hs.101139 



Hs.26334 
Hs.293817 

Hs.297824 



Hs.269211 

Hs.107708 

Hs.184326 

Hs.213897 

Hs.88977 

Hs.250520 

Hs.26813 

Hs.209990 

Hs.23071 
Hs.111911 

Hs.2610O3 

Hs.270618 

Hs.16697 

Hs.27311 

Hs.226306 

Hs.213666 

Hs.27595 

Hs.27621 

Hs.27931 

Hs.1710 

Hs.27463 

Hs.61210 

Hs.33074 

Hs.223296 

Hs.61232 

Hs.28564 

Hs.28607 

Hs.28661 

Hs.165240 

Hs.28783 

Hs.28792 

Hs.28853 

Hs.61271 

Hs.29117 

Hs.29173 

Hs.29279 



hypolhetical protein FU20030 
ESTs 

Homo sapiens dona 25194 mRNA sequence 
ESTs 

ESTs, WeaWy sim3ar to JU0033 hypolheti 
ESTs, Weakly similar to JU0033 hypolheti 
ESTs, Weakly similar to ALU5 HUMAN ALU S 
ESTs 
ESTs 

gbziOlhOlrl Soares.fetal Jtver_spieen_ 
ATP-dependant interferon response protei 
ER01(S.cerevisiae)-like 
ESTs, Moderately similar to ALUE HUMAN! 
S164 protein 
ESTs 
ESTs 

Homo sapiens done 23860 mRNA sequence 
ESTs 

hypothetical protein FU10559 
ESTs 
ESTs 
ESTs 
ESTs 
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2 
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0.12 
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a2 
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Z3 
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£15 
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1 
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452046 


AB018345 


Hs.27657 


KIAA0802 protein 


1.13 


0.39 


upregulate stage 


452092 


BE245374 


Hs.27842 


hvoolhetical orotein FU 11210 


3.2 


0.15 


upregulale stage 


452278 


AL037715 

rXLVOI flu 


Hs 28785 


Homo saniens mRMA' r DMA DKF7n5REF0219 ff 


8.25 


0.07 


upregulate stage 


452381 


H23329 




cctr Wflaltlv Rimilflf fn AIM HUMAN ALUS 


1 


0;34 


uoreaulate staas 


452420 


BE564871 


Hs. 29463 


ppnlrin FF-hsnH nrntpin 3 /f/lMI vpail 

VCIIUIII, tlHOIIU plUlEJM, O |VUWd 1 jCQJl 


4.97 


o!i3 


upregulate stage 


452714 


AW770994 


Hs 30340 


hvnolhpttMl nrotein KIAA11G5 

i i/puti icuwrfii jjiuLCrtu runni iuj 


7.6 


0.09 


upregulate 6tage 


453078 


AF053551 


Hs 31584 


mplaxin 0 


5.3 


0.09 


upregulate stage 


453370 


AI470523 


Hs 182356 

i to. ivcuov 


F£T$ Mndprslplv similar lo translation 


3.78 


0.13 


upregulate stage 


453765 


BE2799Q1 


Hs 35091 


hvDotheUcsJ oroteln FU 10775 


3.95 


0.11 


upregulale stage 


453972 


AW1 37224 


Hs 245869 


ESTs 


6 


0.09 


upregulate stage 


454044 


AW022393 




gb:df37h12.y1 Morton Fetal Cochlea Homo 


1.15 


0.18 


upregulate stage 


454289 


AL1 37554 


Hs 49927 


Homo saniens mRNA* cDNA DKFZo434H1720 li 


7.05 


0.1 


upregulate stage 


454314 


AW364844 




ab'QV3-DT0044-221299-Q45-c03 DT0044 Homo 


1 


0.37 


upregulate stage 


454315 


AW373564 


Hs.251928 


nuclear pore complex interacting protein 


2.7 


0.13 


upregulate stage 


454775 


BE 160229 




gb:QV1-HT0413-090200-062-a12 HT0413 Homo 


8.5 


0.09 


. upregulate stage 


454790 


AW820852 




gb:RC2-ST0301-1 20200-01 1-f 12 ST0301 Homo 


1.15 


0.14 


upregulate stags 


454792 


AW820794 


Hs 252406 


hypothetical protein FU12296 similar to 


3.65 


0,12 


upregulate stage 


455170 


AW860972 




gb:QVO-CTO387-18O30O-l67-h07 CT0387 Homo 


5.7 


0.07 


upregulate stage 


455511 


BE144762 




gb:CMO-HT0180441099-065-b04 HT0180 Homo 


1 


0.25 


upregulate stage 


456141 


AI751357 


Hs.288741 


Homo sapiens cDNA: FU22256 Us, clone H 


8.35 


0.09 


upregulate stage 


456258 


AW976410 


Hs.289069 


Homo sapiens cDNA: FU21016 fis, clone C 


4.85 


0.14 


upregulate stage 


456279 


AW006783 


Hs.6686 


ESTs 


7.25 


0.1 


upregulate stage 


457518 


AA825350 


Hs.143805 


ESTs, Weakly similar to ALU INHUMAN ALU S 


6.84 


0.11 


upregulate stage 


457570 


AA579426 


Hs.190226 


ESTs 


2.6 


0.2 


upregulate stage 


457982 


AW856093 


Hs.183617 


ESTs 


1 


0.25 


upregulate stage 


458080 


BE142728 




gb:Mf^HT0157-021299-004-d08 HT0157 Homo 


2.05 


0.27 


upregulate stage 


458340 


AI457102 


Hs.121583 


Human glucose transporter pseudogene 


2.25 


0.18 


upregulate stage 


458440 


AI095468 


Hs.135254 


ESTs, Weakly similar to thrombospondin t 


2.35 


0.13 


upregulate stage 


458771 


AW295151 


Hs.163612 


ESTs 


1 


0.19 


upregulate stage 


459092 


AA722012 


Hs.255757 


ESTs, Weakly similar to KIAA061 1 protein 


6.95 


0.1 


upregulate stage 



TABLE IB 



Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 


CAT 


Accessions 


407774 


101538J 


AA084958 AA214034 AA044587 AA130152 AA1301 16 D81924 


407939 


1027688 1 


W0560BAW118352AW196215 


407980 


103087 1 


AA046309 AI263500 AA046397 


408224 


1048369 1 


AW175997 AW176000 AW175999 AW175994 AW176004 AW175989 


408241 


1048867J 


AW1 76546 AW178965 BE141057 


408268 


104987.1 


AL138247 AA053529 BE173313 


408277 


1050131.1 


AW177959 AW177960 AW178066 AW178022 


408305 


1050863J 


BE141991 AW178855 BE141990 BE141993 BE141994 BE062405 BE062404 AW178850 


409281 


111502 1 


M069998 AA075659 AA075660 AA069828 


409408 


1128045 1 


AW387837 AW387874 AW387826 AW387758 AW387664 AW809268 AW387754 


409566 


114012 1 


AA078899 AA078782 AA075788 


409760 


115373.1 


AA302840 T93016 T92950 AA077551 


409794 


1154785 1 


AW885691 BE185989 BE 1 85940 AW500322H 10282 


409810 


1155339 1 


AW500895AW937549 


409840 


1156071 1 


AW502122 AW502125 AW501663 AW501720 


410071 


1174764 1 


AW582568 AW818656 AW818647 AW818655 AW818637 AW81 8234 


410128 


117773J 


AW904599 AA325920 AW964919 AA081786T05150 


410269 


1169983 1 


AW613597 BE080235 BE080168 


410475 


1204788 1 


AW749927 AW749938 AW749986 AW749966 R87124 AW860547 


410495 


1205826 1 


N95428 W24040 AW751366 H81987 


410520 


1206965 1 


AW752710 BE180336 BE180186 


410534 


1207247J 


AW905138 AW753008 R13818 Z43519 


410537 


1207336 1 


AW753108 AW852909 N36993 AW894327 AW903629 


410560 


1208311.1 


N29220 AW754225 AW754224 


410562 


1208415 1 


AW858528 AW754362 H74000 


410724 


1218202.1 


AW799269 AW799385 AW799501 AW799386 AW799270 AW799467 


410751 


121964 1 


AA357918 AA089516 AA357917 


410754 


1219733 1 


T63840 AW601569 AW801568 


410785 


1221055.1 


AW803341 AW803265 AW803403 AW603466 AW803402 AW80341 3 AW803268 AW803396 AW803334 AW803355 


410844 


1223878.1 


AW807073 AW807055 AW807067 AW807276 AW807O30 AW807363 AW845892 AW807091 AW807275 AW807284 AW807287 AW845891 






AW807195AW807271 


410910 


1226517.1 


AW810204 AW810555AW810196 AW810619 AWB10507 


410973 


1228236.1 


AW812278 AW812286 AW812274 


410997 


1226736.1 


AW81 2877 AW812878 AW812952 AW812880 


411110 


1232598.1 


H93000AW818022AW818044 


411112 


1232692.1 


AW818158 AW865743 AW865702 AW865698 


411132 


1233580.1 


AW819191 AW819252 AW819183 AW819175 AW819177 AW819186 AW8191 80 BE158470 AW819242 AW819269 AW819244 AW819190 
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20 
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30 



35 
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45 



50 



55 



60 



65 



70 



75 



411137 
411157 
411159 
411162 
411170 
411173 
411193 
411242 



1233775 1 

1234099 1 

1234102.1 

1234161 1 

1234379J 

123446.1 

1235254.1 

! 1 



411245 


1236412.1 


411282 


1237660.1 


411327 


1238874 1 


411336 


1239216.1 


411339 


1239387.1 


411407 


1244071.1 


411418 


. 124518 1 


411425 


1245503 1 


411461 


1246730 1 


411474 


1247047 2 


411526 


1248748.1 




1249443.1 


411568 


1249776 1 


411691 


1254166 1 


411727 


1255052 1 


411741 


1255976.1 


411771 


1257335.1 


411787 


1258789 1 


411788 


125879.1 


411826 


1260008J 


411840 


1260678 1 


41 I860 


126168 1 


411880 


1263110 1 


411932 


1266125.1 




1279147.1 


412178 


1281471.1 


412296 


1288043 1 


412303 


1288130 1 


412305 


1288141 1 


412312 


1288349 1 


412327 


1288935 1 


412354 


1290342 1 


412357 


1290611 1 


412359 


129085 T 


412367 


1291505J 


412389 


1292588 1 


412547 


1305813.1 


412559 


1306011.1 


412671 


1319959 1 


412744 


1324622 1 


412838 


1331178 1 


413119 


1349658 1 


413141 


1350477.1 


413266 


1356260.1 


413294 


1359149 1 


413342 


136279J 


413402 


1366932.1 


413445 


1370833.1 


413485 


1372953 1 


413508 


1374273 1 


413618 


1380384 1 


413707 


1384137 1 


413743 


1385884.1 


413782 


1389068.1 


413792 


138991*J 


413804 


1390710 1 


414091 


141621 T 


414203 


1425510 2 


414220 


1426940 1 


414276 


1432115.-1 



AW819265AW819268 AW819246 BE 1 52602 AW81 9249 AW819251 AW819263AW819194 
AW819455 AW819711 AW819553 «o«m>mw 
AW8198S7 AW819955 AW820159 

S!! 78 AW820183 AW819870 AW819937 AW835709 AW820184 AW820157 AW819945 AW820096 
AW819944 AW820182 AW820168 AW819959 AW819953 
AW820503 AW820306 AW820429 BE174741 AW820244 BE174738 
R81571AW962014AA324277 

AW821484 AW821461 AW821490 AW821525 AW821526 AW821519 



AW833441 AW833552 AW833700 AW833610 AW833673 AW833675 

a^S tXXEE AW938265 AW880592 AW852203 AW880588 AW880680 AW995392 AW880598 AW860603 AwS AW8801 75 



AW837675 AW837356 AW837581 AW83734B AW83768 1 
BE164598 AW837578 AW837582 
R00903 R20424AW844169 

BE241B70 AI752584 Z19223 AA307961 AA181165 AA093272 AW368570 

AW846012 AW846007 AW845996 AW845975 

AW847937 AW847672 AW847809 AW847923 AW847B08 AW847616 

AW851186AW996967BE143456 

BE144593 AW851632 AW968149AW968144 W21059 

^P 66018 BE065982 BE056 082 BE065949 BE066019 BE066081 BE065954 BE066084 
A J^ 3 AW858436AW861923AW858551 AW861872AW858474AW861904AW858398 
AW859650 BE071806 AW859655 
AW993247AW861464 
AW863568 BE161696 BE161629 BE161824 
AW897793 R20054 H23321 



T89420N87510AA095932 " " ™^™«™, AW866333AW866328AW866329 

AW872477 BE088101 T05990 

AW898526AW898525219700 

AW936233AW936272 

AW936336AW936339 

AW936369 AW936377 AW936355 AW93641 0 AW936460 

AW939148 AW939200 BE161819 

AW939537 AW939584 W92705 

AW837985AW837938AA101955AW837913AW837935 

SSiSS AWM6M1 AW946044AW946033 AW946024 AW946021 AW946029 AW946015 AW946016 

AW947655AW984020 

W27151 AW961828 

T31474 T31 123 Z451 46 AW962040 

AW977734 H62048 

N31101 N46491 AW994084 

D61870 D61866BE002674 

BE065941 BE066098 BE066070 BE065997 BE066003 
BE166323BE067045 

BE300352 BE299274 BE075351 BE297444 
BE144034N72932BE080176 
AA128535 AW973587 AA553822 

N52628 BE143539 BE143706 

BE145364 BE145429 BE145418 

BE154078 BE154006 BE153891 BE153523 BE153983 

BE158679 BE158714 

BE161004 BE162497 BE161 135 BE160999 BE162492 
BE546104BE165710BE382801 

BE166924 BE166921 BE166925 BE166915 BE166970 BE166968 

T64682 BE168190 BE168256 

T83742AW672882AA135378 

BE262170 BE382553 BE261026 BE273627 

BE298094 BE267860 

BE297862 
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414366 


1438636.1 


BE549143 8E390613 BE277344 


414833 


1496271.1 


T07114 BE543688 


414985 


1511215J 


C17372 D631B0 D78908 063262 


415027 


1515978.1 


D31010 D30991 D31168 D31166 D31465 


415104 


1522649.1 


D60076D60259D61037 


415114 


1523035.1 


DS0468 D60950D61218N98936 


415178 


1527354.1 


D80503D81739 N95691 


415296 


1533528.1 


F05086 F05091 R17158 


415303 


153360.1 


R11813AA164376 T56632 


415352 


1534744.1 


F06565 Z43466 R18417 F06477 F06476 F07098 


415364 


1535008.1 


F06771H04895 Z42778 


415371 


1535066.1 


R15239Z45189 F06836 


415392 


1535746.1 


Z44067 F07617 R34555 


415569 


1539911J 


Z43930F12170H10881T65261 


415572 


1539951.1 


F12294 F12213T66272T66261 


415600 


1540373.1 


F12664 T74312 N80318Z20039 


415626 


1540758.1 


Z43847 F13068 T75331 


415635 


1540853.1 


F13168 R21289 T77628 


415750 


155215.1 


AA1 67712 AW936024 AW364433 AW364446 


415773 


1554447.1 


R21651 H00542 


415822 


155791.1 


D59243 D63202AA169716 


415948 


156264.1 


AA262226A1984175M171807 


416173 


1574973.1 


R52782 R17313 H24192 R19876 


416343 


1589633.1 


H49213 W90725 H49398 W90726 


416395 


1592777.1 


R94575T99886 H52989 


416624 


1604694.1 


H69044 T47567H75691T50292 


417304 


166556.1 


H15635H16691 AA195506 


417396 


1674485.1 


T98987 T96744 N78306 


417742 


1696282.1 


R64719Z44680 R12451 


417974 


171237.1 


AA210765T95700 H94407 


418297 


1736343.1 


R91254T97156R58711 


418454 


175699.1 


AA315308 AA223392 BE538098 BE087173 


418498 


1762961.1 


T78248 T88763 R93361 


418546 


176677.1 


AA224827 T59708 T59843 BE156903 


418573 


176907.1 


AA225188 AL1 57508 AL157509 AA225189 


418636 


177402.1 


AW749855 AA225995 AW750208 AW750206 


418712 


1784125.1 


Z42183T31621 T97478 


418948 


180808.1 


AJ21 7097 AW886090 W38035 W38792 AA232835 AW936043 


419059 


1816038.1 


T86216TB6117T83316 


419145 


182217.1 


N99638 AW973750AA32B271 H90994 AA558020 AA234435 N59599 R94815 


419472 


185148.1 


AW978038 AA804204 AA243400 


419477 


185172.1 


AA826279 AA243426 AW971614 AW967805 AW971605 


419716 


187390.1 


AA953770AA249091 


419805 


188248.1 


AW966945 AA250984 H91733 AW966941 


419807 


188252.1 


R77402 AA262462 AA250988 R06794 


419834 


188386.1 


AA251 139 AW967485 AA251204 


419932 


189159.1 


AA281 594 AI365043 AA251996 


419998 


189768.1 


AA252691 AW298356 AA688063 


420111 


190755.1 


AA255652AA280911 AW967920 AA262684 


420270 


192170.1 


AA257990 AW816460 AI416981 AW500873 


420643 


195293 1 


W87731AA279004AA279001 


420928 


197723.1 


AA281809HB9487N46537 


421280 


200885.1 


AA81 1804 AA809404 AA286907 AW977624 


421338 


201378.1 


AA287443 AA419385 BE084078 A1478347 


421381 


201903.1 


AA361752 AW963276 AA288017 


421418 


202288.1 


AA806639 AA291008 AA836274 AW978806 


42170B 


205732.1 


AW754341 AW858420 AW858475 AW861969 AW861925 AW858554 AW85861 6 AW858450 AWB58545 


421813 


207654.1 


BE048255 AA3 13083 AA298419 


422204 


213258 1 


AA339015 AW962948 AA306188 


422342 


215498.1 


M309272 AA309312 AW961837 


422429 


216469 1 


AA310527 AW962295 Z44865 H06641 


422504 


217160.1 


AA31 1407 AW958321 N23583 R70050 


422568 


218192.1 


AA312730 AW963285 W28250 W27318 


422760 


221034.1 


BE409561 BE162755AW732798 


422834 


221906.1 


AA318334 AW961457 AA317752 


423100 


224988 1 


AA3231 14 AA321992 BE161391 BE161392 


423121 


225175.1 


AW864848 AA322213 AA322209 AW961624 


423368 


227560.1 


AA364 1 95 AA325029 AW962050 


423632 


230430.1 


AA328824 AW961410 AW958096 


423644 


230585.1 


AA329048AW962484 T83628 


42381 B 


232362.1 


AA332439 AA332579 AA331 470 AW982539 


423837 


232478.1 


AW937063 AW937056 AW937062 AW937059 AW937057 AA331599 AW937061 AW937093 AW937094 AW937058 AW851327 BE082872 






AW752885 AW937055 AW937054 


423912 


233241.1 


BE091233 BE080904 BE079414 BE075356 AA332482 


424353 


238398.1 


M339646 AA339189 AW966286 


424589 


241202.1 


AW854298 AA343691 H5091 7 H50907 H50938 


424951 


245273.1 


AW964082 AA348838 AA348839 
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424993 


245782.1 


425360 


250631 1 


425362 


250655.1 


425672 


254716.1 


426356 


265381.1 


426475 


267785 1 


426561 


269158.1 


426603 


269825.1 


427259 


276594.1 


427566 


280401.1 


427980 


285225 1 


426042 


286292 1 


428192 


288021.1 


428337 


289967.1 


428436 


291472.1 


429236 


301580.1 


429917 


31049 1 


429979 


311517.1 


430076 


312952.1 


430350 


316401 1 


430561 


319932.1 


430640 


321094 1 


430795 


323756 1 


431319 


331474 1 


431322 


331543.1 


431670 


336353 1 


431828 


338201 1 


432093 


341283.1 


432222 


343347 1 


432724 


35330J 


433005 


357346.1 


433108 


35898 1 


433129 


359479.1 


433171 


360292J 


433311 


362908 1 


433371 


364430.1 


433440 


36629 -1 


433586 


370470.1 


433782 


37414.1 


434006 


37855.1 


434092 


379978.1 


434138 


38C572 1 


4342S0 


382816.1 


435382 


405265 1 


436023 


413562.1 


436359 


41847.1 


436383 


41888.1 


436740 


426095.1 


436884 


428731 1 


437087 


432797.1 


437146 


43371.1 


437214 


434730.1 


437240 


435139 1 


437251 


435242.1 


437274 


435465 1 


437324 


43609 T 


437471 


43747.1 


437798 


443128 1 


437854 


44418J 


437938 


44573.2 



438316 
438366 
438390 
438739 
438990 
439183 
439330 



454745.1 
456119 1 
45662.1 
464193 1 
46760.1 
46956.1 
47119 1 



P07625 R35115 AW953115 F05102 H13351 AA349497 F11152AA707254 

BE547704AA355909 

AA355936AI741379 

AA361 483 Ai 557239 

BE536836AA376153 

AL134728AA379975AA379440AW963931 
AA381437 AA628833 AW407275 
AA382291 AA994657 
AA400096AA400167 
AI743515 AA405617 AW276706 
AA418305A1264351 
AA419529H97089 H96977 
AA424051 AW104616 
AA644508 AA479489 AA426174 

BE080180 AW827313 AW231970 AA995028 AA428584 AW872716 AW892508 AW854593 AA578441 AW975234 M664937 AA984131 

AA448407 AA448404 AA448424 AA448426 
H80572 AL1 09691 H8052O 
AA463338 M86137 M85945 A1040226 
AA4651 15 AW967750 AW869525 
BE169639AA476976 

BE065227 BE065366 BE065182 BE065184 AA481239 
AA482636 R16336 R16343 R60706 R53647 
AW971398AA507162AA486494 
AA873350 T82429 T82428 

AW970622 AA503009 AA502998 AA502989 AA502805 T921 88 

AW971287AA524976AA513479 

AA572994 AA516249 M702595 

H28383 AW972670 H28359 AA525808 

AI204995 AW827539 AW969908 AW440776 AA528756 

X98266 N41124 

^mmTmS^ BE16 ° 476 AW939938 AW939206 AW940012 AW939076 AA573577 AW750479 AA574383 AW970057 

AA577814 N68543 N68478 H79613 AI910764 

AA579425 AW969965 AA579102 

AA688149 AW975336 R82513 R79237 

T25451 AA585296AA585305 

AF052127 

T85301 AW517087 AA601054 BE073959 
AF090945 AW996754 AI064870 
AF113688 AI114617 
AA625155BE502124 

AA625804 AW418787 AW074833 AI675642 AI393368 

BE005398 AA628622 AA994155 

N54493AA679039N76605 

T81819 AW393709 AA703541 AW370185 

Z83806 AJ132091 AJ132090 

BE065178AJ227879 

AW975133 AA729943 AAB05813 

BE046657 AA736610 AW827495 

AA745563 AA745334 AA744 168 AA744044 AA744034 AA744056 AA745569 AA744397 

^SIJ A ^ 334473 243283 AW875861 AW938044 BE1 50701 AW936262 AA306862 BE565575 BE567380 AA728920 

AA167612AI239729AI251752AA485791 BE568425AW962958 

BE092336 BE092259 BE092497 BE092051 AA746882 A1335378 

AA747537 BE089068 BE089070 

AW976082 AW976085 AA747803 AA7476B0 

AA747965AI248281A1752458 

AL162077AW023677 

AL390169 AW277196 AW131001 

AWB1 1767 AA831890 AW977539 AA758655 

AL1 19723 AL1 19874 AI909O18 U50537 

^SSc^^^SS N368 ° 9 M208m AW967677 N35320 m5Hn H 59397 AW971573 R97276 W01059 AW967671 AA908598 
^ 5 n 1 | 7 A 5 . .A 1820532 W ^891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 A1219788 AA^4444 N92578 

mcoqcji *> ^^ei ^Irt A ^2898 AI8721 93 AI763273 AW1 73586 AW1 50329 A1653832 Al 762688 AA9$&777 ^^^^^2 A^^^^4 A^M (^^l 3 

2£!^^!K122! AA251669 AA251874 AI819225 AW205852 A1683338 AI858509 AW276905 AI633006AA972584AA908741 AVW72629 
AW513996 AA293273 AA969759 N75628 N22388 HB4729 H60052 T92487 AI022058 AA780419 AA551005 \^70lTwS^37^32 
W^ g F^H834TO181 W78802 R66056 A1002839 R67840 AA300207 AW959581 SfW005 AWB13456A ' 373032 

AA805760BE000856 

AF085890H29949 H29856 
AW970600AA503323 H89218AF086031 H89112 
AF086147W20068 N91464 
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439550 


473671J 


H10438 AA837479 H16299 AW978825 


439780 


47673 1 


AL109688 R23665 R26578 


440012 


483290.1 


AA861072 H02819 R25946 


440635 


498968.1 


AW610331 AW610338 AA897379 


442048 


531432J 


AA974603 A1984319 AW340495 


442518 


544159 1 


AF1 50226 A12401 37 


444163 


593658.1 


AI126098A)184746AI148521 


444282 


599268.1 


AI138955 BE149059 BE149027 


444584 


611496.1 


AI168422 D80113T59074 


444646 


613548.1 


AI184565 AL037304 AW793549 


445396 


638138 1 


BE181792 AJ222266 BE 181 789 BE 181 837 


445832 


651925.1 


AI261545 N59134 AW875371 AW875247 


446119 


662531.1 


D29527A1275850 


446312 


671114.1 


BE087853AJ2861B4 


446922 


69865.1 


BE175605 Z43529 F06610 BE175602 AV661027 


447258 


71439.1 


BE047911 AA984167 BE617316 


447376 


719483.-1 


AI376747 


447522 


724922.1 


BE143888 AI382949 T06429 


448632 


77303.-1 


BE614269 


448643 


773566.-1 


AI557531 


448778 . 


7800.1 


AF074913AW505435U62539 


448871 


78566.2 


BE616709 BE616712 


449213 


80240.-1 


BE616861 


449231 


80303 1 


BE410360 AA442408 AA315540 


449344 


805184.1 


A5640355 BE464385 AW197679 


449666 


81249 1 


AA002O47 N72537 H54142 H81580 


449901 


818599.1 


AI674072 BE268487 


450257 


830121.1 


AW820313 BE065936 BE065948 BE155658 AI689473 AW862275 AW820265 AW820438 AW820331 


450573 


83907.1 


AW964334 AW967230 T0891 3 R1 1 B95 F1 1 463 AA01 0271 


450625 


84032.1 


AW970107 AA513951 AA010406 


450870 


84994 1 


AA011471 AA700636 


450928 


851593.1 


A1744417 R91614 H77365 


451283 


86479.1 


H83979RB4433 M017024 


451359 


86711.1 


H85334 AA017286 H86654 


451365 


867325.-1 


AI791783 


451373 


867533.1 


A1792030AW386116AW875146 


451487 


87131.1 


AA018072N46370 R84847 


451495 


87182 1 


H86887 H86898 H86906 AA01951 9 AA018127 M056306 AA053726 


451535 


87353.1 


AW970577 AA861861 AA418358 AA018329 AA501885 


451724 


882130.1 


AI903765 AI81 1194 BE007147 AW130760 


451844 


888230.1 


T61430AI820546A1821336 


452163 


902067.1 


AI863140 W80703 R43474 


452453 


918300.1 


AI90251 9 A190251 8 A1902516 


452457 


918409.1 


AW062499 AW062495 AW062500 AW062492 AW062494 AW062493 AW062496 AW062498 A190265O AI902649 


452519 


920292J 


BE006701 BE006709 BE006704 AI904981 


452530 


920646.1 


AI905518 AI905516 AI905457 A1905515 AW1 7601 3 AW1 76037 


452536 


921149.1 


BE063380 BE063346 AI906097 


452550 


92188.1 


AA026735 BE264405AL040323 AA323099 


452560 


922216.1 


BE077084 AW139963 AW863127 AW806209 AW806204 AW806205 AW806206 AW80621 1 AW806212 AW806207 AW806208 AW806210 




AI907497 


452738 


929698 1 


AL133800AI916209 


452825 


933090.1 


AI921523 AW903707 AW903687 


453472 


968371.1 


AL037925 AL037931 AL037957 


453677 


977149 1 


AL079389AL079390 


453725 


97B760 1 


W28543AL119531 


453846 


983043 1 


AL157586AL157590 


454044 


996486 1 


AW022393AW022578 


454049 


996858.1 


AW022885AW023064 


454099 


1007650.1 


AW062974 AW859625 BE081366 


454314 


1108161.1 


AW364844 AW364847 AW937534 AW937593 AW937659 


454327 


1114097.1 


BE064097 BE064124 BE064087 AW371617 


454331 


1115278.1 


AW372937 AW378043 AW377970 BE145850 AW377858 AW377964 AW378040 


454380 


1152648.-1 


AW85B722 


454471 


1213795.1 


AW902125 AW792853 AW837703 AW837710 


454524 


1221640.1 


AW857191 AW857192AW804060 


454554 


122384^1 


AW847505 AW81 1792 BE061442 BE061433 AW847506 AW806999 AW806996 BE061436 BE061430 BE142460 BE146499 AW806994 


AW809156 AW606991 AW814082 AW806992 BE061669 AW807002 BE146659 AW806995 AW8O7000 AW845743 AW845747 AW847504 



BE142458 BE061431 BE061435 AW847507 BE146650 BE142470 AW814096 AW807012 BE061438 AW80701 1 AW806993 BE142465 BE142459 
BE142462 AW854330 AW854331 BE061434 BE061731 BE142464 AW847501 AW807001 BE 1 42463 AW8 11 800 BE061437 AWB11802 BE061440 
AW806997 AW806998 BE061745 BE061753 
454592 1226050 1 AW810112AW810114AW810032AW809631 AW810183AW810178 
454622 1226951 1 U70071 AW883715AW810701 
454537 1227707 1 AW81 1613 AW81 1611 AW811610 
454648 1227976 1 AWB1 1960 AW81 1957 AW811 959 

454687 1229077 1 AW814473 AW816157 AW816064 AW813298 AW813315 AW816100 AW813389 AW81 3338 AW813342 AW816108 AW813318 AW816105 

AW816160 AW816163 AW814480 AW813392 AW816063 AW813399 
454692 1229118 1 AW81 3350 AW81 6082 AW81 3476 AW81 3383 
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454702 
454729 
454750 
454775 
454789 
454790 
454797 
454851 
454863 
454893 



454951 
454955 
455040 
455047 
455073 
455128 
455170 
455201 
455207 
455225 
455235 
455267 
455331 
455350 
455351 
455380 
455414 
455428 

455511 

455573 

455585 

455595 

455610 

455647 

455650 

455657 

455669 

455678 

455761 

455831 

455874 

455880 

455903 

455938 

455950 

455951 

455965 

455981 

456034 

456122 
456212 
456235 
456493 
456592 
456800 
457340 
457427 
457437 
457474 
457625 
457892 



458172 
458781 
458841 
458880 



459128 
459182 



1229722.1 
1231926 1 
1233434 1 
1234106 1 
1234742J 
'1234752 1 
1235093.1 
1237414 1 
1237794.1 
1239472 1 
1239833.1 
1246612.1 
1246711J 
1250028 1 
1250536.1 
1252361 1 
1254554 1 
1256906 1 
1259748.1 
1260400 1 
1262318 1 
1265634.1 
1269025 1 
1280616.1 
1283853 1 
1284023 1 
1287679 1 
1288605.1 
1289639.1 

1321229.1 
1332529.1 
1334857 1 
1335422.1 
1337549 1 
1348563.1 
1348720 1 
1349078 1 
1349554.1 
1349716 1 
1359566.1 
1373969 1 
1379514.1 
1380022 1 
1381603 1 
1384537.1 
1385884.1 
1385886 1 
1389984J 
1396805.1 
142696.1 

153360.1 
1655565 1 
168686.1 
193331 1 
202684.1 
233842.1 
322100.1 
336353.1 
337879.1 
341077.1 
373012 1 
432926.1 
471050 1 
497769.1 
743159 1 
784186.1 
80785.1 
868710.1 
918011.1 
922744.1 



BE145915 BE145893 AW892892 BE145806 BE145901 AW814144 
AW817003 AW816997 AW817126 AW817331 AW817120 BE158378 AW817121 
AW866285 AW819000 AW819153 AW819014 AW866541 



BE156314 BE156316AW820750 
AW820852 AW820773 AW821088 
BE161168BE162466AWB21260 

AW835127 BE071463 AW991660 BE071473 BE071461 BE071469 BE071472 

AW835610AW835805 

AW837753 AW837754 AW837700 

AW838125AW838205 

AW847464 AW847462 BE063767 BE063755 

AW852530 AVV852527 AW852526 

AW854829 AW854805 AW854841 AW854825 AW854822 AW854830 AW854835 AW854826 
AW861555AW857776AW61556AW861567AVV857799AW857768 



AW994394 AW865900 AW865905 AW865891 AW866014 AW865898 



AW947878 



AW875951 AW875950 AW875936 AW875948 AW875939 AW875957 
AW880861 AW880883AW881117 AW881012 

A W897251 AW897298 AWB97248 AW897259 AW897250 AW897246 MV897242 

s^s^s^SLr^ 1744 ^ 901753 ^ 901807 ^ 1798 

BE160188 AW935785 BE160401 BE160319 BE160313 BE160395 



AW938192 AW938207AW938194 ™ AW938198AW938203AW938202 

BE144762AW979091 
BE004988BE004574 

BE070794 BE070791 BE070792 BE070795 BE070789 8E070788 BE070786 BE070787 BE070790 
BE008343 BE008321 BE008326 BE008314 BE008345 BE008330 BE008344 BE008327 
BE011703 BE011692 BE011697 BE01 1624 BE01 1673 BE011644 
BE0644 15 BE054430 BE064448 
8E064655BE1 53953 
BE065209 BE065364 BE065110 BE065111 
BE065803BEO65826 
BE066007 BE066017 BE066074 
BE080895 BE080906 
BE144966 BE 144957 BE 144958 

BE152283 BE152260 BE152297 BE152292 BE152259 BE152250 
BE153208BE153146BE152981 
BE155185BE155264BE155186 
BE159432 BE159313 BE159295 BE159242 
BE161004 BE162497 8E161135 BE160999 BE162492 
BE161001 BE162494BE162470BE161172 
BE167014BE167058BE167062 
BE177000 BE177003 BE176816 BE176952 

R1 1813 AA1 64376 T56632 
N51636T51874 T51829 
AA203637AA832266 H67452 
AA261830 AW967855 H26953 AA262478 
R91600 T87079AA291455 
AL1 18754 AA333202 H38001 
AA492071 AA484229AW969850 
AW971287 AA524976 AA513479 
AW969732 AW970974 AA515741 
AW972935 AA525272 N2B227 
T10073H14872AA604786 

BE007237BE007496BE007263W19919BE546311 AA984819 
AJ444821 W26012 
W28965W28971 
AA046742AA551269AA001505 



AI902169 AI902173 A1902323 AI902174 AI902347 
BE178517 AJ908132 BE142437 
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' Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et at" refers to the publication entitled The 

ONA sequence of human chromosome 22." Dunham 1. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

Nt position: Indicates nucleotide positions of predicted exons. _ 



Pkey 


Ref 


Strand 


NLposition 


400471 


9931670 


Minus 


105629-105760 


400495 


9714773 


Minus 


61902-62137 


400528 


6981824 


Plus 


472381-472528,474170-474277,475328-475542,475878476000 


400577 


7960401 


Plus 


101535-101881 


400608 


9887666 


Minus 


96756-97558 


400641 


6117693 


Plus 


4786-4992 


400644 


8117693 


Plus 


27682-27840 


400666 


8118496 


Plus 


17982-18115.20297-20456 


400749 


7331445 


Minus 


9162-9293 


400750 


8119067 


Plus 


198991-199168,199316-199548 


400751 


7331445 


Minus 


35395-35533 


400761 


8131609 


Minus 


114220-115164 


400762 


8131616 


Plus 


7235-7605 


400773 


8131629 


Minus 


44116-44238.48208-48321 


400843 


9188605 


Plus 


5863-5970.7653-7784,8892-9023.9673-9807,10634-10789,15254-15403,23827-23958 


400844 


9188605 


Plus 


24746-24872,25035-25204 


400845 


9188605 


Plus 


34428-34612 


400846 


9188605 


Plus 


39310-39474 


400860 


9757499 


Minus 


151830-152104,152649-152744 


400880 


9931121 


Plus 


29235-29336,36363-36580 


400887 


9958187 


Plus 


119239-121542 


400888 


9958187 


Minus 


199500-199875 


400937 


7652890 


Minus 


89519-89905 


400977 


8072510 


Plus 


73950-74364 


401002 


8J17251 


Minus 


77898-78050 


401024 


8117489 


Plus 


60551-60802 


401045 


8117619 


Plus 


90044-90184,91111-91345 


401048 


7232177 


Plus 


132430-132761 


401049 


7232177 


Plus 


149157-150692 


401086 


9957912 


Plus 


84561-84884 


401093 


8516137 


Minus 


22335-23166 


401101 


8568122 


Plus 


77081-77226 


401192 


9719502 


Minus 


69559-70101 


401197 


9719705 


Plus 


176341-176452 


401203 


9743387 


Minus 


172961-173056,173868-173928 


401205 


9743388 


Plus 


1 67373-1 67433, 1 67 936-1 68031 


401256 


9796573 


Minus 


45482-45620 


401262 


9796963 


Plus 


35662-35799 


401271 


9797373 


Minus 


61292-61911- 


401276 


8954274 


Minus 


15919-16096 


401279 


9800062 


Minus 


13535-13669 


401342 


9908882 


Plus 


3096-3242 


401365 


9796180 


Minus 


119572-119672 


401395 


7327842 


Minus 


11502-11771.46357-46489,58720-58916 


401420 


7452889 


Minus 


141362-141502 


401439 


8246737 


Plus 


92993-94026 


401451 


6634068 


Minus 


119926-121272 


401508 


7534110 


Minus 


110779-110983 


401519 


6649315 


Plus 


157315-157950 


401537 


7960358 


Minus 


186786-187029,190607-190779,198218-198348 


401561 


8224660 


Minus 


10652-10838,19815-20018 


401566 


8469090 


Minus ' 


96277-96420.96979-97160 


401604 


7689963 


Minus 


119835-120185 


401619 


6516761 


Minus 


141309-143576 


401669 


9801805 


Plus 


25414-26310 


401691 


3582311 


Plus 


162333-162715 


401694 


3540172 


Minus 


64056-64168 


401747 


9789672 


Minus 


118596-118816,119119-119244,11960^119761,120422-120990,130161-130381,^ 








131932,132451-132575,133580-134011 


401759 


9929699 


Plus 


59811-60665 


401780 


7249190 


Minus 


28397-28617,28920-29045,29135-29296,29411-29567,29705-29787,30224-30573 


401866 


6016106 


Plus 


73126-73623 


401905 


8671966 


Plus 


153955-154441,156599-156819 


401994 


4153858 


Minus 


42904-43124,43211-43336,44607-44763,4519945281.46337-46732 


402001 


9501818 


Plus 


68052-68223 


402075 


8117407 


Plus 


121907-122035.122804-122921,124019-124161.124455-124610,125672-126076 


402076 


8117410 


Plus 


128316-128627 


402089 


7249154 


Plus 


101610-101819 


402110 


8131678 


Minus 


173889-174062 
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402176 7543887 Minus 10-750 

402230 9966312 Minus 29782-23932 

402245 7690231 Minus 88253-88417 

402296 6598824 Rus 22587-23723 

402325 7636348 Mnus 6065M0738.61577-61803 

402407 3962498 Minus 115812-116187 

402408 9796239 Minus 110326.110491 
402430 9796372 Minus 62382-62552 
402435 9795462 Rus 114593-1155B8 
402472 9797116 Rus 53716-54470 

402474 7547175 Minus 53526-53628,55755-55923,57530-57757 

402480 9797375 Rus 59708-59999 

402490 9797648 Rus 149982-150929 

402522 9798493 Rus 20605-2)731 

402530 7630937 Minus 1524-2003 

402546 7637348 Plus 24673-25170 

402553 9863566 Plus 48292-48398,4956449944 

402604 9909420 Rus 20393-20767 

402716 8969253 Minus 84065-84242 

402727 9211324 Rus 54596-54777 

402812 6010110 Rus 25026-25091,25844-25920 

402820 6456853 Minus 82274-82443 

402846 9408716 Minus 5726-5850 

402889 9931133 Plus 89392-89498,90358-90571 

402892 8086844 Minus 194384-194645 

402901 8894222 Mnus 175426-175667 

402922 8216969 Mnus 19036-19401,19589-19849,19951-20102 

402938 8953442 Plus 22365-22473 

402995 2996643 Minus 5962-6216 

403005 5791501 Minus 16945-17053,20018-20403 

403020 6984114 Minus 96644-97021,97462-97868 

403029 7768593 Minus 4455844766 

403047 3540153 Minus 59793-59968 

403073 8954241 Plus 142964-143260" 

403085 8954241 Rus 165035-165334,165420-165713 

403092 8954241 Rus 174720-175016,175104-175406,175508-175813 

403106 7331404 Plus 77162-77350,81338-81511 

403172 7464784 Minus 64007-64275 

403212 7630897 Minus 156037-156210 

403214 7630945 Minus 76723-77027,79317-79484 

403277 8072597 Minus 27494-27642 

403331 8567936 Rus 169793-169966 

403344 8569726 Rus 70823-70990 

403362 8571772 Plus 64099-64260 

403381 9438267 Minus 26009-26178 

403426 9719529 Minus 157156-158183 

403485 9966528 Rus 2888-3001 ,31 9W532.365541 17 

403567 8101141 Plus 35349-35614 

403588 8101227 Minus 197672-197944 

403590 8101229 Plus 405-1296 

403615 8567964 Rus 107671-107866 

403687 7387384 Rus 9009-9534 

403754 7229815 Minus 163899-164726 

403776 7770611 Mnus 1414-1513,1624-1756 

403822 9369510 Minus 142803-142922 

403851 7708872 Plus 22733-23007 

403860 7708960 Minus 95755-96045 

403894 7381715 Minus 1442-2224 

403903 7710671 Minus 101165-102597 

403959 8224399 Minus 175363-177474 

404015 8655948 ftfinus 587821-588222 

404059 3548785 Plus 104326-106788 

404113 9588571 Minus 13446-13646 

404148 9863703 Rus 78218-78418,79571-79709 

404152 9884757 Rus 41111-41281,4549545716.47801-47910 

404156 9886577 Plus 127319-127754 

404229 7159766 Rus 16607-16841 

404232 8218045 Minus 71800-71956 

404268 9711362 Plus 33238-33463 

404274 9885189 Rus 104127-104318 

404288 2769644 Rus 3512-3691 

404290 2769644 Plus 36651-36813 

404336 9838028 Rus 157951-158129 

404403 7272157 l^nus 72053-72238 

404440 7528051 Rus 80430-61 581 

404488 8113286 Minus 64835-64994 

404498 8151654 Plus 13292-13497 
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404507 


8151803 


Pius 


146359-146739 


404516 


8151967 


Plus 


114153-114322 


404538 


6247909 


Minus 


192748-192945 


404594 


9958262 


Minus 


15310-15510 


404639 


9795778 


Plus 


5779-14387 


404653 


9796999 


Plus 


164997-165230 


404676 


9797204 


Minus 


56167-56342.58065-58169.58891-59048.60452-60628 


404684 


9797403 


Minus 


110881-111020 


4046B5 


9797437 


Minus 


153217-153315,154043-154124.159185-159353.161290-161420,163544-163669.166127-166207,167654-167734 


404704 


9800728 


Minus 


88841-89018 


404819 


4678240 


Plus 


16223-16319,16427.16513.16736-16859.16941-17075.17170-17287,17383-17529,18261-18357.18443^18576 


404829 


6624702 


Minus 


4913-5093.7310-7469.9472-9621.9951-10082 


404860 


8979555 


Plus 


65852-66081 


404874 


9650523 


Minus 


96066-96192 


404881 


5931510 


Minus 


36360-36608 


404894 


6850447 


Rus 


102822-103127 


404939 


6862697 


Plus 


175318-175476 


404977 


3738341 


Minus 


43081-43229 


405033 


7107731 


Minus 


142358-142546 


405059 


7656683 


Plus 


349422 


405064 


7658416 


Rus 


81207-81416 


405071 


7708797 


Minus 


11115-11552 


405102 


8076881 


Minus 


120922-121296 


405167 


9966316 


Plus 


43796-43981,48245-48427,54141-54317 


405170 


9966524 


Pius 


37047-37198 


405177 


7139696 


Minus 


118466-118663 


405186 


7229793 


Rus 


161475-161581.162930-163067 


405258 


7329310 


Plus 


129930-130076 


405281 


6139075 


Minus 


34202-34351,35194-3533M5412-45475.45731-4595B^ 








53667,54111-54279 


405308 


3638954 


Plus 


40778-41034,4138341573 


405349 


2914717 


Minus 


85552-85806 


405379 


6513908 


Minus 


22332-22473,24333-24439 


405390 


6606064 


Minus 


94007-94177 


405411 


3451356 


Minus 


17503-17778,18021-18290 


405463 


7715630 


Minus 


123097-123260 


405494 


8050952 


Minus 


70284-70518 


405520 


9454643 


Rus 


60849-60981 


405526 


9558556 


Minus 


132704-133277 


405580 


4512267 


Rus 


169232-169647 


405600 


5923640 


Rus 


26662-27225 


405654 


4895155 


Minus 


53624-53759 


405720 


9797144 


Plus 


13409-13861 


405725 


9838299 


Minus 


106417-106521 


405735 


9931101 


Minus 


29854-29976 


405738 


9943998 


Rus . 


44370-45410 


405809 


5304920 


Minus 


66554883,8687-8859 


405838 


5686575 


Rus 


3460-3717 


405863 


7657810 


Plus 


49410-49620 


405867 


6758731 


Minus 


74553-75173 


405906 


7705124 


Minus 


10835-11059 


405920 


6758795 


Plus 


120621-120971 


405968 


8247789 


Plus 


14893-15148 


406017 


8272661 


Minus 


46271-46874 


406036 


6758919 


Plus 


17942-18163 


406081 


9123861 


Minus 


38115-38691 


406137 


9166422 


Minus 


30487-31058 


406187 


7289992 


Plus 


8044-8877 


406243 


7417725 


Pius 


38899-39369 


406270 


7534217 


Plus 


13136-13591 


406320 


9211754 


Minus 


20170-20511 


406322 


9212102 


Minus 


130230-130418 


406360 


9256107 


Minus 


7513-7673 


406367 


9256126 


Minus 


58313-58489 


406397 


9256243 


Minus 


127317-127454 


406399 


9256288 


Minus 


63448-63554 


406434 


9256651 


Minus 


17803-17931 


406457 


9795551 


Rus 


182212-182958 


406471 


9795566 


Plus 


87383-87589 


406475 


9797684 


Plus 


125417-125563,126052-128180 


406485 


7711305 


Plus 


125036-125422 


406511 


7711412 


Plus 


177277-177384 


406588 


8189273 


Minus 


135629-135848 
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TABLF ?K 4ffi GEMES UP-RFfilfl ATFD IN BLADDFR GANGER 

Pkey. Unique Eos probesel identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene bile 

R1 : Ratio of mRNA expression in bladder tumors compared to normal bladder 



10 Pkey ExAccn UnigenelD Unigene Title R1 

418818 AA228899 Hs.101307 Homo sapiens HUT 11 protein mRNA, partial 3.473 

412841 AI751157 Hs.101395 hypothetical protein MGC1 1352 ■ 2279 

421066 AU076725 Hs.101408 branched chain aminotransferase 2, mitoc 3.052 

15 435136 R27299 Hs.10172 ESTs 4.717 

444042 NM_004915 Hs.10237 ATP-binding cassette, sub-family G (WHIT 0.003 

421307 BE539976 Hs.103305 Homo sapiens mRNA; cDNA DKFZp434B04 25 (f 188.231 

421318 U63973 Hs.1035O1 rhodopsin kinase 1.381 

421359 AK001589 Hs.103816 hypothetical protein RJ 10727 4 1.000 

20 459462 AA481396 Hs.105167 ESTs 1.000 

430134 BE380149 Hs.105223 ESTs, Weakly similar to T331 88 hypotheU 1.000 

433227 AB04O923 Hs.106808 kelch (DrosophilaJ-iike 1 1.000 

421742 AW970004 Hs.107528 androgen induced protein 1.514 

417366 BE185289 Hs.1076 small pronne-rich protein 1B (comifin) 2.782 

25 444342 NM_014398 Hs.10887 similar to lyscsome-associated membrane 20.064 

451686 AA059246 Hs.110293 ESTs 0.033 

454417 AI244459 Hs.1 10826 trinucleotide repeat containing 9 56.751 

458760 A1498631 Hs.111334 ferritin, Tight polypeptide 2.512 

422119 A1277829 Hs.111B62 KIAA0590 gene product 2.634 

JO 422170 AI791949 Hs.112432 anti-Mu!lerian hormone 0.055 

441877 AW273802 Hs.11340 hypothetical protein FU23047 0.008 

445958 BE326257 Hs.114536 ESTs 0.002 

434288 AW189075 Hs.116265 fibriDin3 11.401 

435347 AW014873 Hs.116963 ESTs 0.003 

35 453134 AA032211 Hs.118493 ESTs 262.962 

444781 NMJ3144G0 Hs.11950 GPkmchored metastasis-associated prate 5.336 

436154 AA764950 Hs.119898 ESTs 103.154 

436246 AW450963 Hs.1 19991 ESTs 0.071 

436293 AI601188 Hs.120910 ESTs 29.129 

40 433078 AW015188 Hs.121575 Homo sapiens cDNA FL) 12231 fis, clone MA 274.769 

438181 AW97860S Hs.122121 ESTs, Weakly similar to 138022 hypotheti 0.024 

449399 AA760881 Hs.122408 ESTs 1.000 

437722 AW292947 Hs.122872 ESTs, Weakly similar to JU0033 hypotheti 4.314 

457465 AW301344 Hs.1229Q8 DNA replication factor 0.264 

45 409757 NM.001898 Hs.123114 cystatinSN 1.390 

439907 AA853978 Hs.124577 ESTs 0.010 

437181 AI306615 Hs.125343 ESTs, Weakly similar to KIAA075B protein 0.344 

440304 BE159984 Hs.125395 ESTs 0.025 

423248 AA380177 Hs.125845 ribu)ose-5-phcsphate-3^pimerase 0.014 

5U 441495 AW294603 Hs.127039 ESTs 0.198 

435376 AW770956 Hs.1 27280 ESTs 0.008 

427685 A1751124 Hs.127311 ESTs 3.244 

423349 AF010258 Hs.127428 homeoboxA9 0.134 

445457 AF168793 Hs.1 2743 carnitine aoctanoyitransferase 7.255 

55 420759 T11832 Hs.127797 Homo sapiens cDNA FLJ11381 fis, done HE 1.000 

441875 A1435973 Hs.128056 ESTs 0.013 

441940 AW298115 Hs.128152 ESTs 6.075 

445537 AJ245671 Hs.12844 EGF-iike-domain, mufttpfe 6 0.137 

429983 W92620 Hs.1 28656 ESTs 16Z590 

60 445600 AF034803 Hs.12953 PTPRF tnteracbng prolan, binding prote 0.969 

437553 A1829935 Hs.130497 ESTs, Weakly similar to MAT8 HUMAN CHLOR 9.163 

459204 AW194601 Hs.13219 ESTs 1.000 

439842 AI910896 Hs.132413 ESTs 1.000 

443113 AI040686 Hs.132908 ESTs 0.069 

65 423853 AB011537 Hs.133465 slit (Drosophila) homolog 1 0.100 

420792 AA280321 Hs.13392 tethering factor SEC34 16.103 

427719 A1393122 Hs.134726 ESTs 0.667 

443861 AW449462 Hs.134743 ESTs 5.100 

447578 AA912347 Hs.136585 ESTs, Weakly similar to JC531 4 COC28/cdc 1.691 

70 445550 A1242754 Hs.137306 ESTs 0.006 

454284 AW297935 Hs.138493 ESTs, Moderately similar to ALU7.HUMAN A 0.003 

418937 T71508 Hs.13861 ESTs, Weakly similar to T42383 probable 0.042 

424098 AF077374 Hs.1 39322 small proline-rich protein 3 1.347 

453370 AI470523 Hs.139336 ATP-binding cassette, sub-fam3y C {CFTR 0.186 

75 424099 AF071202 Hs.139336 ATP-binding cassette, sub-family C (CFTR . 47.949 

426900 AW163564 Hs.142375 ESTs 0.404 

439337 AA448718 Hs.142505 ESTs . 0.012 
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427961 AW293165 Hs.143134 ESTs 0.073 

419888 AI243493 Hs.144049 ESTs 11.958 

413943 AW294416. Hs.144687 Homo sapiens cDNA FU 12981 fis, done NT 77.269 

445871 AI702901 Hs.145582 ESTs. Weakly similar to F0R4 MOUSE FORMI 183,782 

5 445911 AI985987 Hs.145645 ESTs. Moderately similar to ALU INHUMAN A 0.362 

424395 AA1650B2 Hs.146388 rnicrotubufe-associated protein 7 203.038 

424411 NrVL0O52O9 Hs.146549 cryslallin.betaA2 1.808 

444517 AI939339 Hs.146883 ESTs 0.004 

_ 445020 AI205655 Hs.147221 ESTs 0.307 

10 422109 S73265 Hs.1473 gaslrin-releasing peptide 1.000 

445352 AI221087 Hs.147761 ESTs 0.015 

444444 AI149332 Hs.14855 ESTs 140.859 

444152 AI125694 Hs.149305 hypothetical protein MGC2603 2.037 

446248 AI283014 Hs.149638 ESTs 0.018 

15 433159 AB035898 Hs.150587 Wnesirvlike protein 2 0.110 

428004 AA449563 Hs.151393 glutamate-cysteine ligase, catalytic sub 3.179 

456840 H03754 Hs. 15221 3 wingless-type MMTV integration site fami 0.005 

456844 AI264155 1 Hs.152981 CDP-diacylglycerd synthase (phosphatida 1.111 

rt 425206 NM 002153 Hs.155109 hydroxysteroid(17-beta) dehydrogenase 2 257.949 

20 446082 AI274139 Hs.156452 ESTs 0.779 

444946 AW139205 Hs.156457 hypothetical protein FU224C8 1.919 

446636 AC002563 Hs.15767 citron (rho-interacting, serine/lhreonin 0.417 

447073 AW204821 Hs.157726 ESTs 10.349 

422765 AW409701 Hs.1578 baculovirallAP repeat-containing 5 (sur 1.839 

25 446673 NNL016361 Hs.15871 LPAP for lysophosphatkfic acid phosphata 1.691 

447475 A1380797 Hs.158992 ESTs 44.641 

425776 U25128 Hs. 159499 parathyroid hormone receptor 2 0.340 

418343 AA216372 Hs.159501 ESTs 0.023 

441143 AI027604 Hs.159650 ESTs ■ 0.280 

30 440917 AA909651 Hs.160025 ESTs 1.000 

418365 AW014345 Hs.161690 ESTs 0.066 

431839 AW020280 Hs.162025 ESTs 0.005 

446839 BE091926 Hs.16244 mitotic spindle coited-coii related prot 0.606 

438817 AI023799 Hs.163242 ESTs 2.202 

35 432441 AW292425 Hs.163484 ESTs 2.305 

442577 AA292998 Hs.163900 ESTs 688.038 

435212 AW300100 Hs.164185 ESTs 0.002 

425048 H05468 Hs.164502 ESTs 0.083 

442083 R50192 Hs.165062 ESTs 3.844 

40 423536 L22075 Hs.1666 guanine nucleotide binding protein (G pr 0.157 

418678 NM 001327 Hs.167379 cancer/teslis antigen 269.487 

446989 AK001898 Hs.16740 hypothetical protein FU 1 1 036 0.208 

456967 AW004056 Hs.168357 T-box2 160.397 

447979 AI457197 Hs.170348 ESTs .0.016 

45 458814 AI498957 Hs.170861 ESTs, Weakly similar to Z195JWMAN ZINC 1.036 

446312 BE087853 Hs.171802 ESTs, Weakly similar to T08729 RING zinc 1.334 

426783 Z19084 Hs.172210 MUF1 protein 1.654 

423916 AW993496 Hs.17235 Homo sapiens done TCCOA001 76 mRNAsequ 154.064 

409092 AI735283 Hs.172608 ESTs 0.007 

50 426853 U32974 Hs.172777 baculoviral 1AP repeat-containing 4 0.009 

426968 U07616 Hs.1 73034 ampniphysin (Stiff-Mann syndrome with br 0.002 

407581 R48402 Hs.173508 P3ECSL 0.866 

427239 BE270447 Hs.174070 ubiquitin carrier protein 15.708 

427268 X78520 Hs.174139 chloride channel 3 207.936 

55 436577 W84774 Hs.17643 ESTs 62.333 

420876 AA918425 Hs.177744 ESTs 32.959 

427528 AU077143 Hs.179565 minichromosome maintenance deficient (S. 1.171 

427585 D31152 Hs.179729 collagen, type X. alpha 1 (Schmid metaph 1.000 

427747 AW411425 Hs.180655 serine/threonine kinase 12 12.446 

60 429813 AW139678 Hs.180791 ESTs 0.013 

439806 AA846B24 Hs.180908 ESTs 0.561 

427878 C05766 Hs.181022 CGI-07 protein 0.002 

440284 AA912032 Hs.181059 ESTs. Weakly similar to 2108276A ssDNA-b 0.030 

427922 AK001934 Hs.181112 HSPC126 protein 0.039 

65 427972 AA864870 Hs.181304 putative gene product 0.004 

428071 AF212848 Hs.182339 ets homologous factor 4.321 

428336 AA503115 Hs.183752 mlcroseminoprotein. beta- 145.128 

428450 NWL014791 Hs.184339 KIAA01 75 gene product 0.370 

428479 Y00272 Hs.1 84572 ceD division cycle 2. G1 to S and G2 to 0.632 

70 438746 A1885815 Hs.184727 ESTs 0.339 

420557 AA960844 Hs.186579 Homo sapiens, clone I MAGE:4081 483. mRNA 0.006 

431014 W67730 Hs.187573 ESTs 0.344 

428651 AF196478 Hs.188401 annexinA10 1.459 

416225 AA577730 Hs. 188684 ESTs. Weakly similar to PC4259 ferritin 0.502 

75 432497 AA551104 Hs.1 89048 ESTs, Moderately similar to ALUCJWMAN1 2.499 

431474 AL133990 Hs.190642 ESTs 0.044 

427742 AA411880 Hs.190888 ESTs 0.158 
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428058 


AI821625 


Hs.191502 


431245 


AA496933 


Hs.1 91687 


453204 


R10799 


Hs.191990 


436608 


AA628980 


Hs.192371 


447342 


AI199268 


Hs.19322 


454032 


W31790 


Hs.194293 


449121 


A1915858 


Hs.194980 


447827 


U73727 


Hs.19718 


438401 


AL04S321 


Hs.197484 


457200 


U33749 


Hs.197764 
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Ks.1 98249 


429257 
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Ks.198365 


429276 


AF056085 


Hs.198612 


449818 


AW594365 


Hs.199365 


429345 


R11141 


Hs.199695 


443564 


A1921685 


Hs.199713 


449847 


AW204447 


Hs.199750 


449351 


AW016537 


Hs.200760 


426322 


J05068 


Hs.2012 


434411 
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Hs.201372 


448045 
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446555 


AV659046 
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442282 
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427587 


BE348244 
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429486 


AF155827 
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426682 
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426746 
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Hs.2057 


448275 


BE514434 


Hs.20830 


459058 


H85939 


Hs.209605 


441795 


N58115 


Hs.21137 


451592 


AI805416 


Hs.213897 


443367 


AW071349 


Hs.215937 


415949 


H10562 


Hs.21691 


444008 


BE544855 


Hs.220756 


432548 


AW973399 


Hs.22133 


427867 


NM_005073 


Hs.2217 


453123 


AI953718 


Hs.221849 


439569 


AW602166 


Hs.222399 


428227 


AA321649 


Hs.2248 


435956 


AF269255 


Hs.22604 


453883 


AI638516 


Hs.22630 


431253 


R06428 


Hs.226351 


453900 


AW003582 


Hs.226414 


430034 


X60155 


Hs.227767 


423017 


AW178761 


Hs.227948 


417997 


AA418189 


HsJ23017 


441362 


BE614410 


Hs.23044 


452956 


AW003578 


Hs.231872 


446009 


A1989885 


Hs.231926 


430499 


AW969408 


Hs.231991 


448560 


BE613183 


Hs^3213 


441508 


AW015203 


Hs.232237 


453228 


AW628325 


Hs.232327 


442167 


H18740 


Hs.23248 


453321 


A1984381 


Hs.232521 


449207 


AL044222 


Hs.23255 


430152 


AB001325 


Hs.234642 


439239 


AKJ31540 


. Hs.235331 


435087 


AW975241 


Hs.23567 


451276 


AW294386 


Hs.236533 


447343 


AA256641 


Hs.236894 


431011 


AA490831 


Hs.23783 


430307 


BE513442 


Hs.238944 


444371 


BE540274 


Hs.239 


424264 


080400 


Hs.239388 


449722 


BE280074 


Hs.23960 


430486 


BE062109 


Hs.241551 


430168 


AW968343 


Hs.24255 


452292 


AW139588 


Hs.244369 


412661 


N32860 


Hs.24611 


456682 


AW500321 


Hs.246766 


457343 


NMJJ13936 


Hs.247862 


430978 


U53583 


Hs.248182 


431020 


AF097874 
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431070 


AW408164 


Hs,249184 


431098 


AW501465 


Hs.249230 



ESTs 0.006 
ESTs 0.006 
ESTs 1,734 
down syndroms critical region protein OS 1 1 5.500 

Homo sapiens, Similar to RIKEN cDNA 2010 0.152 
ESTs, Weakly similar to 154374 gene NF2 60.103 
ESTs 0.003 
protein tyrosine phosphatase, receptor I 305.974 
ESTs, Weakly similar to JC4296 ring fing 0.002 
thyroid transcription factor 1 0.011 
gap junction protein, beta 5 (connexin 3 9.390 
2,3-bisphosphoglycerate mutase 178.436 
G protein-coupled receptor 5 1 0.729 
ESTs 1.000 
hypothetical protein 7.339 
ESTs 0.001 
organic anion transporter polypeptide-re 1 .000 

ESTs 0.005 
transcobalamin I (vitamin 812 binding pr 381 .474 

ESTs 0.039 
prostate stem ceil antigen 2.337 
ESTs 0.024 
ESTs 0.004 
ESTs 1.000 
ESTs, Weakly similar to 178885 serineAh 228.705 
hypothetical protein FU10339 0.133 
UDP giycosyl transferase 1 family, polype 2.070 

uridine monophosphate synthetase (orotat 0.528 
kinesin-tike2 19.718 

EST 0.005 
AD024 protein 1.000 

ESTs 0.012 

ESTs 0.003 

ESTs 0.072 

ESTs. Weakly similarto SFR4.HUMAN SPUC 213.962 

hypothetical protein FU201 21 0.250 

solute carrier family 15 (oligopeptide t 0.01 0 

ESTs 0.566 

CEGP1 protein 10.625 

small Inducible cytokine subfamily B (Cy 27.603 

lysosomal apyrase-like protein 1 127.564 

cofactor required for Sp1 transcriptiona 2.216 

ESTs 0.023 

ESTs, WeaWy similar to ALU8.HUMAN ALU S 0.103 

zinc finger protein 41 1.000 

serine (or cysteine) proteinase inhibito 0.024 

Homo sapiens cDNA: FU22747 fis, clone K 1 .635 

RAD51 (S. cerevisiae) homolog (E coii Re 120.167 

ESTs i.ooo 

ESTs 4.000 

ESTs 0.014 

ESTs 285.090 

ESTs 0.261 

ESTs i.flOO 

hypothetical protein from EUROIMAGE 2005 0.240 

ESTs O.609 
nucleoporin155kD . 0.551 
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gb:UI-H-BI3-a)a-a-12-<UJI.s1 NCI_CGAP_Su 


0.807 


409488 


AW402825 




gb:UI-HF-BK0-aacrd-08^Ul.r1 NIHJAGCJ6 


0.965 


437938 


A1950087 




gb:wq05c02.x1 NCI_CGAP_Kid12 Homosapien 


2.952 


451385 


AA017656 




gb:ze39h01.r1 Soares retina N2b4HR Homo 


7.341 


449325 


AA001162 




gb:ze48b06.r1 Soares retina N2b4HR Homo 


0.004 


413316 


W91931 




gb:zh47c01.r1 Soares_fetalJiver_spleen_ 


0.004 
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401016 
401335 
401555 
401760 
401781 
401961 
402239 
402305 
402424 
402777 
402778 
402837 
402948 
402952 
403142 
403297 
403637 
403657 

404136 , 

404249 

404875 

404917 

404983 

405238 

405364 

405531 

405601 

405621 

405932 

406117 

406354 

406548 

406599 

459702 AI204995 
TABLE 2B 



0.342 

0.256 

1.000 

301.372 

247.141 

1,722 

5.180 

0.917 

551.141 

153.231 

0.006 

0.367 

154.103 

17.038 

0.196 

12.744 

0.304 

0.032 

0.008 

0.065 

1.105 

69.590 

1.000 

1.000 

294.141 

1.747 

145.551 

0.224 

1.968 

0.333 

1.000 

0.002 

0.010 

0.449 



Pkey: Unique Eos probeset identifier number 
CAT number: Gene duster number 
Accession: Genbank accession numbers 



409488 
412799 
412964 
413100 
413158 
413316 
413671 
414221 

414969 
415204 
421934 
424040 
428436 

431322 
432189 
433005 
433106 
434414 



434572 
436383 
437240 
437938 



CAT Number 
1134791J 
132817 1 
1339278 1 
13491 19 1 
1351251 1 
1360169 1 
1382504 1 
142696.1 

151Q393J 
1529407 1 
209339J 
234659 1 
291472.1 

331543 1 
342819 1 
357346 1 
J 
1 



38911 J 
41888 1 
435139J 
44573J2 



Accession 

AW402825 BE544338 

AI267606AA121045AA126521 

BE019688BE144460 

BE065208 BE065224 BE065168 BE065313 

BE068098 BE068119 BE068083 BE068088 BE068120 BE068155 BE06811 1 
W91931 W94979 BE081744 

Z43712I BE156729 BE156538 BE156731 BE156673 BE156539 BE156674 BE156430 BE156672 BE156675 BE156432 BE156541 

AW450979 AA136653 AA136656 AW419381 AA984358 AA492073 BE168945 AA809054 AW238038 BE01 1212 BE01 1359 BE01 1367 

BE01 1368 8E01 1362 BE011215 BE01 1365 BE01 1363 

C16195 C16230 C16211 C16164 C16251 

T27434 Z25288 F00323 D82802 R85077 

AA300625 R16859 R16860 AW898335 W24337 

AA334400 AA334257 AW966124 

BE080180 AW827313 AW231970 AA995028 AA428584 AW87271 6 AW892508 AW854593 AA578441 AW975234 AA664937 M984131 AA528743 
AA527941 AI810608AI620190 AA635265 

^02^ 4 TraU6° 73 BE16<M76 AW939938 AW93 9206 AW940012 AW939076 AA573577 AW750479 AA574383 AW970057 

MS™f1^?^n£! AW11616VW0557 BE142245 AW858232AW861851 AW858362 AA232351 AA218567AA055556 AW858231 
AWB57541 AW814172H66214 AWB14398 AF134164 AA243093 AA 173345 AA199942 AA223384 AA227092 AA227O80 T 12379 AA092174 
I 6 .! H^ll 9 ^^ 699829 AW879188 AW813567 AW813538 AI267168 AA 1577 18 AA157719 AA100472 AA100774 AA130756 AA157705 

AA206583 AA209204 BE1 56909 AA226824 

AF147340T51948T52029 
BE065178AJ227879 
AA747537 BE089068 BE089070 

4io?ncni AW967677 N35320 A1251473 H59397 AW971573 R97278 W010S9 AW967671 M908598 AA251875 

A1KOM1AI820532W87891 T85904 U714S6T82391 BE328571 T75102 R34725AA884922BE328517 AK19788 AA884444N92S78F13493 

SS^^'^^ 5 *™""*"'^™ AI950344AI741M?AI6890ffl 
^SI! ^^98*8721 93 AI7»273A^ 

A 5^^,^ W ? )K12AI961530AW629970 866,2881 AW276997AW513601 AW512843 AA044209 AW856538 AA1 80009 AA337499 
AW961101 AA251669AA251874AIE19225AW205862AI683338A1858509 AW276905 A1633006 AA972584 AA908^W07^AVW13996 
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AA293273 AA959759 N75628 N22388 H84729 H60052 T92487 A1022058 AA780419 AA551005 W80701 AW613456 AI373032 A1564269 F00531 
H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 



438990 


46760.1 


AF085890H29949H29856 


4397B0 


47673.1 


AL1096B8R23665 R26578 


440012 


483290.1 


AA861072H02819 R25946 


443309 


56604.1 


AI821874 AI821868 AA630932 AA653897 AA650103 AI821 131 A1821 124 


444163 


593658 J 


Al 1 26098 Al 1 84746 All 48521 


449325 


80480.1 


AA001 162 AA018950 AA017505 R84446 


451385 


86787.1 


AA017656 AA017374 AA019761 


453823 


982526.1 


AL137967 BE064160 BE064186 


453846 


983043J 


AL1575B6AL157590 


454789 


1234742.1 


8E156314 BE156316AW820750 


455170 


1256906 J 


AW860972 AW862598 AW862599 AW860988 AW860983 AW860898 AW860925 AW860922 AW860986 AW860984 AW860989 


455189 


1259271.1 


AW864176 AW864133 AW864185 AW864137 


455380 


1287679.1 


BE160188 AW935785 BE160401 BE160319 BE160313 BE160395 


455650 


1348720J 


BE064655 6E153953 


455797 


1366826.1 


BE091833 BE091874BE091871 


455807 


1370914 1 


BE141140 BE141139 BE141105 BE141143 BE141127 BE141202 BE141108 


455831 


1373969.1 


BE144966BE144957BE144958 


458829 


773443.1 


AI557388 BE158936 


459169 


920641.1 


A1905517 AI905455 AI905452 



TABLE 2C ' 

Pkey: Unique number corresponding to an Eos probesef 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled The DKA 

sequence of human chromosome 22." Dunham I. et al., Nature {1999) 402:48 9-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


401016 


8117441 


Plus 


126234-1 26359, 1 28050-128236 


401335 


9884881 


Plus 


15736-16352 


401555 


8099284 


Minus 


162520-162657 


401760 


9929699 


Plus 


83126-83250,85320-85540,94719-95287 


401781 


7249190 


Minus 


83215-83435,83531-83656,83740-83901,84237-84393,84955-85037,86290-86814 


401961 


4581193 


Minus 


124054-124209 


402239 


7690131 


Plus 


38175-38304,4213342266 


402305 


7328724 


Plus 


40832-41362 


402424 


9796344 


Minus 


64925-65073 


402777 


95B8235 


Plus 


126786-126948 


402778 


9588235 


Plus 


128560-128702 


402837 


9369121 


Minus 


2013-2186.9570-9758. 1 1 136-1 1309. 1 9429-19677,21 21 0-21455.23368-23562.24342-24627,29 1 32-29320 


402948 


9368458 


Minus 


143456-143626,143808-143935 


402952 


9408724 


Minus 


119452-119619 


403142 


9444521 


Plus 


89286-90131 


403297 


8096824 


Minus 


16584-17264 


403637 


8671936 


Minus 


142647-142771,145531-145762 


403657 


8843996 


Minus 


156223-156370 


404136 


6981900 


Minus 


4253846428 


404249 


8655533 


Plus 


6427064633 


404875 


9801324 


Plus 


96588-96732,97722-97831 


404917 


7341851 


Plus 


49330-49498 


404983 


4432779 


Minus 


51178-51374,52000-52173 


405238 


7249119 


Minus 


51728-51836 


405364 


2281075 


Minus 


4832548491,49136-49252 


405531 


9665194 


Plus 


35602-35803 


405601 


5815493 


Minus 


147835-147935,149220-149299 


405621 


5523811 


Plus 


59362-59607 


405932 


7767812 


Minus 


123525-123713 


406117 


9142932 


Plus 


54304-54584 


406354 


9256049 


Minus 


2095-2377 


406548 


7711514 


Minus 


25138-26762 


405599 


8248616 


Plus 


10933-11086 
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Table 3A: Preferred therapeutic targets for Madder cancer 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene TitIe:Unigene gene Kile 

Rl: 90th percentile of bladder tumor Als divided by the 30lh percentBe of normal bladder sample Als 

R2: 90th percentile of bladder tumor Als divided by the 90th percentile of normal body sample Als 



Pkey 


ExAccn 


UnigenelD 


421948 


142583 


Hs.334309 


439926 


AW014875 


Hs.137007 


413324 


V00571 


Hs.75294 


421110 


AJ250717 


Hs.1355 


417308 


H60720 


Hs.81692 


431211 


M86849 


Hs.323733 


418406 


X73501 


Hs.84905 


446619 


AU076643 


Hs.313 


433001 


AF217513 


Hs.279905 


408243 


Y00787 


Hs.624 


417715 


AW969587 


Hs.86366 


417720 


AA205625 


Hs.208067 


423673 


BE003054 


Hs.1695 


418007 


M13509 


. Hs.83169 


413753 


U17760 


Hs.75517 


441633 


AW958544 


Hs.1 12242 


422168 


AA586894 


Hs.1 12408 


407242 


M18728 




405033 






449230 


BE613348 


Hs.211579 


406685 


M18728 




420159 


AI572490 


Hs.99785 


415511 


AI732617 


Hs.182362 


426028 


NivLOOntO 


Hs.1 72028 


424008 


R02740 


Hs.1 37555 


428651 


AF196478 


Hs.188401 


400843 






402230 






452747 


BE153855 


Hs.61460 


416065 


BE267931 


HsJ8996 


428450 


NMJ)14791 


Hs.184339 


418322 


AA2B4166 


Hs.84113 


412610 


X90908 


Hs.74126 


418663 


AK001100 


Hs.41690 


414683 


S78296 


Hs.76888 


442432 


BE093589 


Hs.38178 


424834 


AK001432 


Hs.153408 


431958 


X63629 


Hs.2877 


423725 


AJ403108 


Hs.1 321 27 


401780 






424308 


AW975531 


Hs.154443 


401093 






417933 


X02308 


Hs.82962 


418113 


AI272141 


Hs.83484 


412140 


AA219691 


Hs.73625 


401781 






425234 


AW152225 


Hs.165909 


432842 


AW674093 


Hs.334822 


408380 


AF123050 


Hs.44532 


449722 


BE280074 


Hs.23960 


420344 


BE463721 


Hs.97101 


422809 


AK001379 


Hs.121028 


404977 






400409 


AF153341 




429113 


D28235 


Hs.196384 


444371 


8E540274 


Hs.239 


443171 


BE281128 


Hs.9O30 


441362 


BE614410 


Hs.23044 


439963 


AW247529 


Hs.6793 


407137 


T97307 




426088 


AHJ38007 


Hs.166196 


443426 


AF098158 


Hs.9329 


414761 


AU077228 


Hs.77256 


413063 


AL035737 


Hs.75184 


421508 


NWL004833 


Hs.105115 


443162 


T49951 


Hs.9029 



Unigene Title 
keratin 6A 
ESTs 

corticotropin releasing hormone 
cathepsin E 

K1AA0101 gene product 

gap junction protein, beta 2, 26kD (conn 

cytokeratin 20 

secreted phosphoprotein 1 (osteopontin, 

clone HQO310PRO0310p1 

interteukin 8 

ESTs 

ESTs 

matrix metalloproteinase 1 2 (macrophage 
matrix metalloproteinase 1 (interstitial 
laminin, beta 3 (nicein (125kD), kaMn 
normal mucosa of esophagus specific 1 
S100 caldurn-binding protein A7 (psorias 
gb:Human nonspecific crossreacting antig 
C1002652*:gil544327jsp|Q04799JFMO5_RABlT 
melanoma cell adhesion molecule 



' Homo sapiens cDNA: FU21245fis, clone C 
ESTs 

a disintegrin and metalloproteinase doma 
putative chemoklne receptor GTP-binding 
annexinA10 

NMJH)3105*:Horno sapiens sortflin-related 
Target Exon 

Ig superfamity receptor LNIR 
proliferating cell nuclear antigen 
K1AA0175 gene product 
cyclirvdependenl kinase inhibitor 3 (COK 
fatty acid binding protein 6, Seal (gas 
desmocolfin 3 

hypothetical protein MGC12702 
hypothetical protein FU23468 
Homo sapiens cDNA FU 10570 fis, clone NT 
cadherin 3, type 1, P-cadherin (placenta 
hypothetical protein LOC57822 
NMJ05557*:Homo sapiens keratin 16 (foca 
minichromosome maintenance deficient (S. 
C12000586*:giI6330167|dbjIBAA86477.1| (A 
thymidyiate synthetase 
SRY (sex determining region Y)-box 4 
RAB6 interacting, kinesin-Bke (rabkines 
Target Exon 

ESTs, Weakly sim3ar to I38022 hypotheti 
hypothetical protein MGC4485. . 
diubiquitin 
cyc&iBl 

putative G protein-coupled receptor 
hypothetical protein FU 10549 
Insufin-like growth factor 2 (somatomedi 
Homo sapiens winged helix/forkhead trans 
prostaglandin-endoperoxide synthase 2 (p 
fbrkhead box M1 
TONDU 

RAD51 (S. cerevisrae) homoiog (EcoBRe 
platelet-activating factor acetylhydrola 
gb:ye53h05.s1 Soares fetal fiver spleen 
ATPase, Qass I, type 6B, member 1 
chromosome 20 open reading frame 1 
enhancer of zeste (Orosophila) homoiog 2 
chitrnase 3-fike 1 (cartilage gtycoprote 
absent in melanoma 2 
DKFZP434G032 protein 



Rl 


R2 


14.20 


1.20 


11.31 


21.34 


9.15 


45.75 


9.07 


45.35 


8.50 


1.99 


8.39 


1.28 


8.10 


40.50 


7.98 


1.38 


7.67 


2.12 


7.56 


4.85 


7.45 


4.70 


7.34 


9.18 


7.30 


26.07 


7.12 


35.60 


6.95 


4.96 


6.42 


0.89 


6.08 


3.49 


5.96 


0.96 


5.84 


16.22 


5.82 


2.28 


5.80 


0.89 


5.77 


28.85 


5.65 


28.25 


5.60 


6.51 


5.59 


2.33 


5.55 


27.75 


5.51 


4.92 


5.36 


21.44 


5.33 


2.80 


5.17 


1.98 


4.90 


2.63 


4.77 


2.35 


4.77 


3.71 


4.74 


1.48 


4.74 


2.92 


4.68 


1.61 


4.65 


11.63 


4.63 


2.06 


4.55 


3.35 


4.49 


1.62 


4.43 


2.39 


4,40 


12.94 


4.35 


2.29 


4.32 


2.82 


4.24 


8.15 


4.15 


1.31 


4.14 


7.39 


4.12 


2.51 


4.11 


3.26 


4.09 


3.72 


4,07 


2.50 


4.00 


7.14 


3.89 


5.17 


3.88 


7.29 


3.87 


1.61 


3.87 


2.75 


3.83 


9.48 


3.82 


Z98 


3.77 


3.83 


3.73 


4.91 


3.68 


7.08 


3.68 


129 


3.57 


3.06 


3.67 


1.18 


3.67 


3.65 


3.66 


3.21 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



418478 


U38945 


Hs.1174 


cyclln-dependent kinase inhibitor 2A (me 


3.66 


3.57 


417771 


AA804698 


Hs.82547 


retinoic acid receptor resportder (tazaro 


3.62 


2.73 


441495 


AW294603 


Hs,127039 


ESTs 


3.60 


2.71 


422282 


AF019225 


Hs, 114309 


apolipoprotein L 


3.57 


3.92 


417079 


U65590 


Hs.81134 


interleukin 1 receptor antagonist 


3.55 


0.80 


417275 


X63578 


Hs.295449 


paiv albumin 


3.54 


4.60 


440006 


AK000517 


Hs.6844 


hypothetical protein FU20510 


3.52 


£59 


418203 


X54942 


Hs.83758 


CDC28 protein kinase 2 


3.50 


3.61 


400289 


X07820 


Hs.2258 


matrix metailoproteinase 10 (stromelysin 


3.50 


17.50 


404875 






NM_022819*:Homo sapiens phospholipase A2 


3.4S 


3.24 


420005 


AW271106 


Hs.133294 


ESTs 


3.40 


2.22 


409757 


NM 001898 


Hs.123114 


cystatin SN 


3.39 


2.93 


427719 


A1393122 


Hs.1 34726 


ESTs 


3.31 


2.51 


406690 


M29540 


Hs.220529 


carcirtoembryonic ant/gen-related cell ad 


3.2B 


0.42 


422283 


AW411307 


Hs. 114311 


COC45 (cell division cycle 45, S.cerevis 


3.2B 


2.62 


406081 






Target Exon 


3.25 


13.54 


426514 


BE616633 


Hs.170195 


bone morphogenetic protein 7 (osteogenic 


3.25 


2.46 


431009 


BE149762 


Hs.48956 


gap junction protein, beta 6 (connexin 3 


3.23 


2.88 


429983 


W92620 


Hs.260855 


ESTs 


3.20 


2.84 


422158 


L10343 


Hs, 112341 


protease inhibitor 3, skin-derived (SKAL 


3.18 


1.54 


426451 


A1908165 


Hs.169946 


GATA-binding protein 3 (T-cell receptor 


3.17 


5.44 


411945 


AL033527 


Hs.92137 


v-myc avian myelocytomatosis viral oncog 


3.13 . 


15.65 


415752 


BE314524 


Hs.78776 


putative transmembrane protein 


3.11 


2.46 


408633 


AW963372 


Hs.46677 


PRO2000 protein 


3.11 


3.30 


409956 


AW103364 


Hs,727 


inhibin, beta A (activin A, activin AB a 


3.09 


1.52 


412420 


ALC35668 


Hs.73853 


bone morphogenetic protein 2 


3.08 


2.22 


400297 


AJ127076 


Hs.306201 


hypothetical protein DKFZp56401278 


3.05 


12.49 


437931 


AI249468 


Hs.124434 


ESTs 


3.01 


3.70 


421451 


AA291377 


Hs.50831 


ESTs 


2.99 


14.95 


426682 


AV660038 


Hs.2056 


UDP glycosyltransferase 1 family, polype 


2.97 


Z10 


402239 






Targel Exon 


2.97 


3.37 


429345 


R11141 


Hs. 199695 


hypothetical protein 


2.96 


2.61 


435904 


AF261655 


Hs.8910 


1 ,2-a)pha-mannosidase 1C 


Z93 


2.13 


423961 


D13666 


Hs.136348 


periostin (OSF-2os) 


2.93 


1.44 


420923 


AF097021 


Hs.273321 


differentially expressed in hernatopoieti 


2.92 


0.95 


436608 


AA628980 




down syndrome critical region protein OS 


2.92 


4.86 


447343 


AA256641 


Hs.236894 


ESTs, Highly sircar to S02392 alpha-2-m 


2.88 


2.93 


439223 


AW23B299 


Hs.250618 


UL16 binding protein 2 


2.88 


2.15 


401747 






Homo sapiens keratin 17 (KRT17) 


2.88 


3.44 


410102 


AW24B508 


Hs.279727 


ESTs; homologue of PEM-3 {Clona savignyi 


2.86 


14.30 


444444 


A1149332 


Hs.14855 


ESTs 


2.85 


2.68 


421100 


AW351839 


Hs.1 24660 


Homo sapiens cDNA: FU21763 fe, clone C 


2.84 


2.46 


431070 


AW408164 


Hs.249184 


transcription factor 1 9 (SC1) 


2.78 


2.26 


417389 


BE260964 


Hs.82045 


midkine (neurite growth-promoting factor 


2.77 


2.34 


442994 


A1026718 


Hs.16954 


ESTs 


2.75 


2.82 


444381 


BE387335 


Hs.283713 


ESTs, WeaWy similar to S64054 hypotheti 


2.74 


2.44 


434487 


AF143867 


Hs.337588 


ESTs, Moderately similar to S65657 alpha 


2.72 


3.37 


417003 


AL038170 


H3.80756 


betaine-homocysteine methyltransferase 


2.69 


2.70 


404440 






NM_021048:Homo sapiens meianoma antigen, 


2.69 


13.45 


400844 






NMJ)03105*:Homo sapiens sortflirwelated 


2.69 


13.45 


426322 


J05068 


Hs.2012 


transcobaJamin 1 (vitamin B12 binding pr 


2.69 


1.36 


431448 


AL137517 


Hs.306201 


hypothetical protein DKFZp56401278 


2.69 


12.08 


403381 






ENSP000DD231844*:Ecotropic virus Integra 


2.68 


13.40 


411248 


AA551538 


Hs.334605 


Homo sapiens cDNA FLJ 14408 fis, done HE 


2.68 


2.43 


425206 


NWL002153 


Hs.155109 


hydroxysteroid (17-beta) dehydrogenase 2 


2.67 


2.68 


435099 


AC004770 


Hs.4756 


flap structure-specific endonudease 1 


2.67 


1.79 


409361 


NM.005982 


Hs.54416 


sine oculis homeobox (Drosophlla) homolo 


2.65 
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10.36 


Target Exon 


1.37 


1.16 


matrix melailoproteinase 1 1 (stromelysin 


1.36 


1.39 


prostaglandin E synthase 


1.33 


1.49 


ATP-bmding cassette, sub-family C (CFTR 


1.31 


1.44 


WAA0590 gene product 


1.31 


1.22 


heticase-moi 


1.30 


0.64 


hypothetical protein FU12442 


1.30 


1.32 


RecQ protein-like 4 


1.29 


1.42 


secretory leukocyte protease inhibitor ( 


1.29 


0.61 


ESTs 


1.27 


6.35 


ESTs, Moderately sircar to S29539 ribos 


1.26 


6.30 


uroplakin3 


1.26 


226 


gb:H.sapiens protein-serine/lhreontne ki 


1.26 


1.11 


DESC1 protein 


1.25 


1.01 


cancer/testis antigen (NY-ESO-1) 


1.23 


1,17 


POD domain, class 5, transcription facto 




J. 09 


5T4 oncofetal trophoblast glycoprotein 


1.20 


1.38 


Homo sapiens cDNA: FU22495 fis, clone H 


1.19 


1 .LO 


C5001632*:qi|10645308lablAAG21430 1IAC00 


1.19 


1 Oft 

1.30 


uroplakin 1A 


1.17 


2.30 


Homo sapiens cDNA FU 12981 fis, done NT 


1.17 


5.85 


endothelial differentiation, rysophospha 


1.17 


1.39 


folRstatin 


1.17 


1.17 


Homo sapiens clone 24416 mRNA sequence 


1.13 


2.83 


epiregulin 


1.13 


5.65 


Predicted cation efflux pump 


1.11 


5.55 


wingless-type MMTV integration site fami 


1.08 


5.40 


ESTs 


106 


5.30 


anti-Muilerian hormone 


1.08 


1.55 


ESTs 


i!os 


1.02 


ESTs 


1.04 


7.20 


ESTs 


1.03 


0.65 


G protein-coupled receptor 39 


1.03 


0.92 


matrix metalloproteinase 13 (collagenase 


1.02 


5.10 


Homo sapiens mRNA; cDNA OKFZp566GC321 (f 


1.02 


0,60 


heparin-binding growth factor binding pr 


1.00 


0.05 


serine (or cysteine) proteinase inhibito 


1.00 


0.25 


SOX7 SRY (sex determining region YVbox 


1.00 


0.25 


ESTs ^ ^ 


1.00 


0.27 


interleukin 1 receptor, type II 


1.00 


0.45 


desmoglein 3 (pemphigus vulgaris antigen 


1.00 


0.72 


Target Exon 


1.00 


1.00 


gb:Human parathyroid hormone-related pep 


1.00 


1.00 


ESTs 


1.00 


1.00 


ESTs, Weakly simflar to I54374 gene NF2 


1.00 


1.00 


parathyroid hormone-like hormone 


1.00 


1.00 


transforming growth factor, beta 2 


1.00 


1.00 


amphiphysin (Stiff-Mann syndrome with br 


1.00 


1.00 


progesterone receptor 


1.00 


1.00 


tumor necrosis factor, alpha-induced pro 


1.00 


1.00 


cadherin 6, type 2, KHSdherin (fetal ki 


1.00 


1.00 


tnsultrtoma-assodated 1 


1.00 


1:25 


tumor necrosis factor receptor superfami 


1.00 


1.35 


ESTs, Weakly simflar to GGC1.HUMAN G ANT 


1.00 


1.75 


gb:yc06a03.s1 Stratagene lung (937210) H 


1.00 


1.80 


ESTs 


1.00 


2.20 
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410044 


BE566742 


Hs.58169 


highly expressed in cancer, rich In leuc 


1.00 


2.25 


425048 


H05468 


Hs. 164502 


ESTs 


1.00 


2.25 


422956 


BE545072 


Hs. 122579 


ECT2 protein (Epithelial cell transform! 


1.00 


2.0U 


449448 


D60730 


Hs.57471 


ESTs 


1,00 


2.70 


417791 


AW965339 


Hs. 11 1471 


ESTs 


1,00 


2.95 


421373 


AA80B229 


Hs. 167771 


ESTs 


1,00 


3.00 


427356 


AW023482 


Ha.97849 


ESTs 


1.00 


3.15 


421070 


AA283185 


Hs. 19327 


ESTs 


1.00 


3.25 


415542 


R13474 


Hs.290263 


ESTs, Weakly similar to 138022 hypotheU 


1.00 


3.35 


429486 


AF155827 


Hs.203963 


hypothetical protein FU 10339 


1.00 


3.55 


402075 






ENSP00000251056*:Piasma membrane calcium 


1,00 


3.95 


419559 


Y07828 


Hs.91096 


ring finger protein 


1.00 


4.00 


416661 


AA634543 


Hs.79440 


IGF-II mRNA-binding protein 3 


1.00 


4.00 


418738 


AW388633 


Hs.6682 


solute carrier family 7, (cationic amino 


1.00 


4.35 


412723 


AA648459 


Hs.335951 


hypothetical protein AF301222 


1.00 


4.40 


404877 






NM_005365:Homo sapiens melanoma antigen, 


1.00 


4.45 


443054 


AI745185 


Hs.8939 


yes-associated protein 65 kDa 


1.00 


4.45 


403047 






Nty_0Q5656*:Homo sapiens transmembrane pr 


1.00 


4.50 


406434 






NMJ)30579*:Horno sapiens cytochrome b5 ou 


1.00 


4.65 


412530 


AA766268 


Hs.266273 


hypothetical protein FU 13346 


1.00 


4.65 


433365 


AF026944 


Hs.293797 


ESTs 


1.00 


10.05 


427666 


AI791495 


Hs.180142 


calmodulin-like skin protein (CLSP) 


0.99 


0.60 


429504 


X99133 


Hs.204238 


lipocalin 2 (oncogene 24p3) 


0.99 


1.00 


431474 


AL133990 


Hs.1 90642 


CEGP1 protein 


0.94 


9.14 


411880 


AW872477 




gb:hm30f03,x1 NCLCGAP_Thy4 Homo sapiens 


0.93 


21.15 


414221 


AW450979 




gb:Ul-H-BI3-a!a-a-12-0-Ul.s1 NCI_CGAP_Su 


0.91 


2.60 


444649 


AW207523 


Hs.1 97628 


ESTs 


0.89 


11.15 


456034 


AW45Q979 




„l,| ii ii nio _}_ . j(i n in .4 Mpi r»/*»AD On 

go:UI- H-Bio-aia-a-l 2-O-U i.si N u_vajAP_o u 


n no 


c 4*1 

0.10 


414521 


D28124 


Hs.76307 


neuroblastoma, suppression of tumorigeni 


0.84 


0.85 


439569 


AW602166 


Hs.222399 


CEGP1 protein 


0.84 


2.42 


432222 


A1204995 




gb:an03c03.x1 Stratagene schizo brain S1 


0.81 


6.75 


407846 


AA426202 


Hs.40403 


Cbp/p300nnteracting transactivator, wit 


0.80 


0.57 


457292 


AI921270 


Hs.281462 


hypothetical protein FL) 14251 


0.77 


1.40 


431089 


8E041395 




ESTs, Weakly similar to unknown protein 


0.76 


14.88 


459702 


A1204995 




gban03c03.x1 Stratagene schizo brain S1 


0.74 


11.03 


424503 


NWL002205 


Hs.149609 


integrin, alpha 5 (fibronectin receptor, 


0.56 


0.80 


406964 


M21305 




gb:Human alpha satellite and satellite 3 


0.53 


11.16 


416225 


AA577730 


Hs.1 88684 


ESTs, Weakty similar to PC4259 ferritin 


0.48 


3.01 


400288 


X06256 


Hs.149609 


integrin, alpha 5 (fibronectin receptor, 


0.40 


0.70 


421218 


NMJ500499 


Hs.72912 


cytochrome P450, subfamily 1 (aromatic c 


0.40 


0.61 



IABLL2B 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 


CAT Number 


Accession 


411880 


12631 10J 


AW872477BE088101T05990 


413804 


1390710.1 


T64682 8E168190 BE168256 


414221 


142696 J 


AW450979 AA1 36653 AA1 36656 AW419381 AA984358 AA492073 BE168945 AA809054 AW238038 BE011212 BE01 1359 BE011367 






BE01 1368 BE01 1362 BE011215 BE011365 BE01 1363 


415989 


156454 J 


A1267700 AI720344 AA191424 AI023543 AI469633 AA172056 AW958465 AA172236 AW953397 AA355086 


431089 


327825 1 


BE041395 AA49J826 AA621946 M715980 AA666102 


431322 


331543J 


AW970622 AA503009 AA502998 AA502989 AA502805 T921 88 


432222 


343347J 


A1204995 AW827539 AW969908 AW440776 AA528756 


436608 


4236U , 


AA628980 AI126603 BE504035 


439780 


47673.1 


AL109688 R23665 R26578 


444163 


593658 1 


AI126098AI184746AI148521 


445182 


632151 1 


AW189787 AI215430 AW268499 AW205930 AJ392907 BE093017 BE093019 BEO93O10 


451844 


888230J 


T61430AJ820546AI821336 


456034 


142696.1 


AW450979 AA1 36653 AA136656 AW419381 AA984358 AA492073 BE168945 AA809054 AW238038 BE01 1212 BE01 1359 BE011 367 




BE011368 BE011362 BE011215 BE011365 BE011363 



TABLE 3C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source, The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled The DNA 

sequence of human chromosome 22/ Dunham I. et a!., Nature {1999} 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref Strand NLposition 

400773 8131629 Minus 44116-44238,48208-48321 
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400843 


9188605 


Plus 


5863»5970 l 7653.7m,8892.9023 l 9673.9807J0^10789,15254.154M l 23827-23^ 


400844 


9188605 


Plus 


24746-24B72.25035-25204 


400845 


9188605 


Plus 


34428-34612 


400846 


9188605 


Plus 


39310-39474 


401093 


8516137 


Minus 


22335-23166 


401747 


9789672 


Minus 


118596-118816.119119-119244,119809-119761,1204^^ 


401760 






131932,132451-132575,133580-134011 


9929599 


Plus 


83126-83250,85320-85540,94719-95287 


401780 


7249190 


Minus 


28397-28617.28920-29045.29135-29296.29411-29567,29705-29787.30224-30573 


401781 


7249190 


Minus 


83215-83435,83531-83656,8374(W3901,84237^4393,84955^5037,86290-86814 


402075 


8117407 


Plus 


121907-122035,122804-122921,124019-124161,124455-124610,12672-126076 


402230 


9966312 


Minus 


29782-29932 


402239 


7690131 


Plus 


38175-38304,42133-42266 


402260 


3399665 


Minus 


113765-113910,115653-115765,116808-116940 


402305 


7328724 


Plus 


4083241362 


402424 


9796344 


Minos 


64925-65073 


402777 


9588235 


Plus 


126786-126948 


402901 


8894222 


Minus 


175426-175667 


403047 


3540153 


Minus 


59793-59988 


403381 


9438267 


Minus 


26009-26178 


403903 


7710671 


Minus 


101165-102597 


404403 


7272157 


Minus 


72053-72238 


404440 


7528051 


Plus 


80430-81581 


404875 


9801324 


Plus 


96588-96732.97722-97831 


404877 


1519284 


Plus 


1095-2107 


404977 


3738341 


Minus 


43081-43229 


405033 


7107731 


Minus 


142358-142546 


405932 


7767812 


Minus 


123525-123713 


406081 


9123861 


Minus 


38115-38691 


406399 


9256288 


Minus 


63448-63554 


406434 


9256651 


Minus 


17803-17931 


406467 


9795551 


Plus 


182212-182958 
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TABU 4A: Preferred diagnostics for bladder cancer 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unlgene number 

Unigene Title: Unlgene gene title 

R1 : 80th percentile of muscle-invasive bladder tumor (stage T2-T4) Als divided by the 80lh percentile of exophytic non-invasive carcinoma {stage Ta) Als 

R2 90th percentile of bladder tumor Als minus background divided by 90th percentile of normal body sample Als minus background, where background equals 

the 1 5th percentile of aU sample Als 

R3 90th percentile of bladder tumor Als divided by the 90th percentile of normal body sample Als , 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


R3 


423961 


D13666 


Hs.136348 


periost/n (OSF-2os) 


11.22 


1.40 


1.44 


446619 


AU076643 


Hs.313 


secreted phosphoprotein 1 (osteopontin, 


8.40 


1.31 


1.38 


444381 


BE387335 


Hs.283713 


ESTs, Weakly similar to S64054 hypolheti 


7.88 


2.06 


Z44 


408243 


Y00787 


Hs.624 


interteukin 6 


7.54 


Z86 


4.85 


413063 


AL035737 


Hs.75184 


chitinase 3-like 1 (cartilage glycoprole 


6.14 


1.18 


1.18 


414020 


NM_0Q2984 


Hs.75703 


small inducible cytokine A4 (homologous 


5.32 


1.34 


1.36 


424247 


X14008 


Hs.234734 


lysozyme (renal amyloidosis) 


5.27 


0.61 


0.57 


418007 


M13509 


• Hs.83169 


matrix metalloproteinase 1 (interstitial 


5.17 


7.47 


35.60 


422158 


LI 0343 


Hs.1 12341 


protease inhibitor 3, skin-derived (SKAL 


4.30 


1.75 


1.54 


446921 


AB012113 


Hs. 16530 


small inducible cytokine subfamily A (Cy 


4.29 


1.35 


1.60 


413324 


V00571 


Hs.75294 


corticotropin releasing hormone 


4.20 


6.27 


45.75 


412429 


AV650262 


Hs.75765 


GR02 oncogene 


4.00 


0.94 


0.93 


406636 


LI 2064 




gb:Homo sapiens (clone WR4.12VL) anli-lh 


3.66 


1.25 


1.25 


428330 


L22524 


Hs.2256 


matrix metalloproteinase 7 (matrilysin, 


3.58 


2.07 


Z26 


406687 


M31126 




matrix metalloproteinase 11 (strometysin 


3.41 


4.37 


3.37 


422550 


BE297626 


Hs.296049 


microfibriilar-associated protein 4 


3.09 


0.39 


0.40 


410867 


X63556 


Hs.750 


fibrillin 1 (Marfan syndrome) 


2.96 


0.44 


0.45 


416658 


U03272 


Hs.79432 


fibrillin 2 (congenital contracture ara 


2.90 


1.94 


3.46 


414812 


X72755 


Hs.77367 


monokine induced by gamma interferon 


2.80 


1.67 


3,10 


423673 


BE003054 


Hs.1695 


matrix metalloproteinase 12 (macrophage 


2.77 


5.62 


26.07 


421379 


Y15221 


Hs.103982 


small inducible cytokine subfamily B (Cy 


2.71 


Z26 


Z91 


429500 


X7B565 


Hs.289114 


hexabrachion (tenascin C, cytotaclin) 


2.40 


0.47 


0.37 


417849 


AW2915B7 


Hs.82733 


nidogen 2 


Z34 


0.88 


0.86 


400419 


AF084545 




Target 


2.33 


1.54 


2,12 


407811 


AW190902 


Hs.40098 


cysteine knot superfamily 1, BMP antagon 


2.10 


1.01 


1.01 


400289 


X07B20 


Hs.2258 


matrix metalloproteinase 10 (stromelysin 


1.99 


3.19 


17.50 


414774 


X02419 


Hs.77274 


plasminogen activator, urokinase 


1.94 


2.49 


1.84 


409420 


Z15008 


Hs.54451 


larrrinin, gamma 2 (nicein (100kO), kafini 


1.94 


2.02 


5.39 


428227 


AA321649 


Hs.2248 


small inducible cytokine subfamily B (Cy 


1.84 


1.99 


8.55 


414476 


AA301867 


Hs.76224 


EGF-containing fibulin-like extracellula 


1.76 


0.33 


0.31 


431639 


AK000680 


Hs.266175 


phosphoprotein associated with GEMs 


1.76 


1.13 


1.16 


421958 


AA357185 


Hs.109918 


ras homolog gene family, member H 


1.75 


1.33 


1.27 


425247 


NM_005940 


Hs.155324 


matrix metalloproteinase 11 (stromerysin 


1.58 


2.42 


1.39 


444342 


NM.014398 


Hs.10887 


similar to lysosome-associated membrane 


1.57 


2.02 


8.55 


421493 


BE300341 


Hs.104925 


ectodermal-neurai cortex (with BTB-like 


1.55 


1.57 


1.55 


407939 


W05608 


Hs.312679 


ESTs, Weakly similar to A4901 9 dynein he 


1.41 


0.79 


0.67 


429344 


R94038 


Hs.199538 


inhibin, beta C 


1.36 


1.39 


1.34 


402727 






NM_025065:Homo sapiens hypothetical prot 


1.34 


1.34 


1.56 


412420 


AL035668 


Hs.73853 


bone morphogenetic protein 2 


1.31 


1.63 


Z22 


423217 


NMJW0094 


Hs.1640 


collagen, type VII, alpha 1 (epldermolys 


1.27 


2.00 


1.67 


424206 


NMJJ03734 


Hs. 198241 


amine oxidase, copper containing 3 (vase 


i *}ti 

%.£0 


O 1R 


u.ou 


415138 


C18356 


Hs.295944 


tissue factor pathway inhibitor 2 


1.23 


1.70 


4.34 


424479 


AF064238 


Hs.149098 


smoolhelin 


1.19 


0.27 


0.47 


445537 


AJ245671 


Hs.12844 


EGF-like-domain, multiple 6 


1.17 


1.59 


Z93 


417079 


U65590 


Hs.81134 


interleukin 1 receptor antagonist 


1.16 


0.82 


0.80 


421634 


AA437414 


Hs.106283 


hypothetical protein FU 10262 


1.16 


1.05 


1.05 


439569 


AW602166 


Hs.222399 


CEGP1 protein 


1.15 


2.01 


2.42 


431346 


AA371059 


Hs.251636 


ubiquitin specific protease 3 


1.10 


1.64 


1.52 


448901 


AK001021 


Hs.22505 


hypothetical protein FU10159 


1.10 


0.31 


0.31 


450983 


AA305384 


Hs.25740 


ER01 (S.cerevisiaeHike 


1.03 


1.51 


1.42 


422424 


AI186431 


Hs.296638 


prostate differentiation factor 


1.02 


2.77 


2.68 


458781 


AJ444821 


Hs.63085 


ESTs, Weakly similar to MPP3.HUMAN MAGUK 


1.00 


1.64 


5.45 


445413 


AA151342 


Hs.12677 


CGI-147 protein 


1.00 


1.51 


5.20 


432350 


NM.005865 


Hs.274407 


protease, serine, 16 (thymus) 


1.00 


1.55 


4.30 


403106 






C8000Q64':gl)10432393[emb|CAC10283.1| (A 


1.00 


1.48 


4,24 


402075 






ENSP00000251056*:Plasma membrane calcium 


1.00 


1.67 


3.95 


404860 






C1003394*:gi|12314272|emb|CAC00591.1| (A 


1.00 


1.40 


3.90 


434037 


AF116601 




WW domain-containing oxidoreductase 


1.00 


1.58 


3.70 


405738 






CX000390*:giJ6014646lgb|AAF01438.1|AFt87 


1.00 


1.36 


Z95 


427585 


D31152 


Hs.179729 


collagen, type X, alpha 1 (Schmid metaph 


1.00 


1.45 


1.60 


439898 


AW505514 


Hs.209561 


KIAA1 715 protein 


1,00 


1.28 


1.59 


452567 


D87120 


Hs.29882 


predicted osteoblast protein 


1.00 


1.10 


1.31 


401271 






C9000559*:gi|12314195|emblCAB99338.1l (A 


1.00 


2.12 


1.00 


411339 


BE164598 


Hs.274251 


hypothetical protein FU20375; KIAA1797 


1.00 


2.05 


1.00 
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403005 






431145 


&OJ03U 




434939 


AP1R1A99 
r\r | D 1 1^ 
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431753 


Y7RAOQ 
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ns.*o4i 


419121 




Ue QQR9R 

nS.oygtO 


435505 


AP9TOMQ9 


Mc 91 191H 


452401 
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408397 






404483 






441206 


BE552314 


He 1^1R91 
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407853 


AA11A7Q7 


Uc 40400 
ns.*KRy3 


446119 




We 9000^1 


406471 






4021)0 






40791 1 


a pin/ 079 


He 41 Wi 
ns.*» 1300 


404829 






421925 


OOUO IU 


He 1ftQR9n 


406076 


ai *?qm 7Q 




45867? 




MS. 13/00 


416018 




He 7RQ77 


409357 


M7^R9fl 


lie CAMS. 
MS. 34410 




AWQ7R^1Q 




436178 


DC IDZjjO 


Ue- oi con 


402522 






HVWf 33 












W1 1 DC 






418693 


AI7KTia7D 
MJ/OUO/O 




*to t j /a 
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AIU010D7 


HS.l 1330 
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DczouyM 


Ue QinAtl 


403214 






414799 




He 77^9R 
nS.f (Oca 


fUQCuO 


II99QR1 


He 1R4411 


401519 






417501 


A) 041910. 


He R9999 


*tU300/l 


\A/74nm 
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He CC97Q 
MS. OO^i y 








444171 


MDU I Ot*iy 


He M4CA 




AWR79R97 


Ue CQ7R1 




AJQ0J4O/ 


L)_ 4 CO01 0 


*i£D/ ID 


HfVl_wDOfy 


Ue 171004 


43i34f 




nS.2oibo4 


413753 


U 17760 


He 7RR17 
ns. / 30 1 / 


426322 


J05068 


Hs.2012 


426514 


BE616633 


Hs.170195 


422282 


AF019225 


Hs.1 14309 


409757 


NM_001898 


Hs.123114 


427450 


AB014526 


Hs.178121 


414555 


N98569 


Hs.76422 


423774 


L39064 


Hs.1702 


404977 






428336 


AA503115 


Hs.183752 


451668 


Z43948 


Hs.326444 


428651 


AF196478 


Hs.188401 


421110 


AJ250717 


Hs.1355 



C210U0027 :gi)1817556fdbj[BAA13672.1| (D 


1.00 


1.89 


1.00 


Human DNA sequence from PAC 82J11 and co 


1.00 


1.89 


1.00 


Homo sapiens HSPC304 mRNA, partial cos 


1.00 


1.86 


1.00 


neuromedin U 


1.00 


1.82 


1.00 


parathyroid honrone-Eke hormone 


1.00 


1.69 


1.00 


inieneuKin- 1 rtomoiog i 


1.00 


1.67 


1.00 


tumor necrosis factor, alpha-induced pro 


1.00 


1.67 


1.00 


Ci 600 1 44r:gij 1 2053709|emb|CAC204 1 9. 1 1 { 


1.00 


1.66 


1.00 


NM_030958*:Homo sapiens organic anion tr 


1.00 


1.56 


1.00 


bo 1 s, weawy strmiar to TERA HUMAN [H^a 


1.00 


1.49 


1.00 


dicKKopf (Xenopus laevis) homolog 1 


1.00 


1.47 


1.00 


CCTe 


1.00 


1.47 


1.00 


Target Exon 


1.00 


1.44 


1.00 


r'mnfini 7Q*»n iQon77QinMv.ir > A^iccjin a\ /a 


1.00 


1.42 


1.00 


growth differentiation factor 8 


1.00 


1.40 


1.00 


Cl 00293 r:gi|7499208|plr||T20993 hypolhe 


1.00 


1.37 


1.00 


acidic epididymal gtyco protein-tike 1 


1.00 


1.26 


1.00 


Homo sapiens mRNA; cDNA DKFZp547P134 (fr 


1.00 


1.19 


1.00 


f-box and WD-40 domain protein 2 


1.00 


1.09 


1.00 


proprotetn convertase subtiBsin/kexin t 


1.00 


1.04 


1.00 


casein, kappa 


1.00 


1.03 


1.00 


ATP-binding cassette, sub-family A (ABC1 


1.00 


0.84 


0.84 


hypothetical protein DKFZp564O0523 


1.00 


0.91 


0.80 


C1000568*:gi]12697965|dbj]BAB21801.1j (A 


1.00 


0.60 


0.67 


ENSP00000252164*:KIAA1578 protein (Fragm 


1.00 


0.86 


0.56 


ENSP00000252232*:Sterol regulatory eleme 


1.00 


0.65 


0.52 


C6C00 93 1 *:gi| 955 8454|d bj JBAB0339 8. 1 1 (AB 


1.00 


0.58 


0.51 


thrombospondin 1 


1.00 


0.85 


0.51 


Homo sapiens BAC clone RP11-481J13 from 


1.00 


0.46 


0.38 


hypothetical protein FU20647 


1.00 


0.64 


0.33 


thymopoietin 


1.00 


0.61 


0.29 


1/IAA4CCC 

KIAA1555 protein 


1.00 


0.60 


0.28 


small inducible cytokine subfamily A (Cy 


1.00 


0.39 


0.19 


carboxypeptidase M 


1.00 


0.43 


0.13 


slit (Drosophila) homolog 3 


1.00 


0.30 


0.12 


butyrylcholinesterase 


1.00 


0.30 


0.08 


complement component 7 


1.00 


0.08 


0.06 


Homo sapiens, done MGC: 12401, mRNA, com 


1.00 


0.10 


0.01 


CGt-27 protein 


0.96 


1.32 


1,55 


midkine {neurite growth- promoting factor 


0.95 


3.15 


Z34 


NM_Oio232^:Homo sapiens interleukm 1 re 


0.94 


1.63 


2.51 


insulin-like growth factor binding prote 


0.92 


1.87 


1.60 


albumin 


0.92 


1.09 


1.03 


CI o00047o^:gi| 1 2737279!ref|XP J)1 2 1 63, 1 1 


0.88 


1.46 


3.44 


sema domain, immunoglobulin domain (Ig), 


0.87 


0.41 


0.50 


serine {or cysteine) proteinase inhibito 


0.85 


1.38 


1.43 


U2001837 :gi|12697903|dbjIBAB21770.1| {A 


0.83 


1.46 


4.65 


small inducible cytokine subfamily A (Cy 


0.80 


0.91 


0.91 


to 1 s, weaKly similar to DAP1_HUMAN DEATH 


0.79 


0.58 


0.43 


wingless-type MMTV integrafion site fami 


0.77 


1.47 


2.37 


sema domain, Immunoglobulin domain (Ig), 


0.75 


1.13 


1.18 


insultn-iike growth factor 2 (somatomedi 


0.68 


2.61 


1.87 


laminin, beta 3 (nicein (125kD), kalmm 


0.68 


2.70 


4.96 


frai^balarnin 1 (vitamin B1 2 binding pr 


0.67 


1.50 


1.36 


bone morphogenetic protein 7 (osteogenic 


0.56 


2.05 


2.46 


apolipoprotein L 


0.55 


3.91 


3.92 


cystatin SN 


0.53 


2.72 


2.93 


KIAA0626 gene product 


0.52 


1.34 


1.97 


phosphoiipase A2. group IIA (platelets, 


0.50 


1.04 


1.05 


interleukin 9 receptor 


0.49 


2.81 


6.46 


insulin-like growth factor 2 (somatomedi 


0.28 


5.78 


5.17 


microseminoprotein, beta- 


0.21 


1.47 


1.56 


carulage acidic protein 1 


0.18 


4.05 


3.60 


annexin A10 


0.17 


5.14 


27.75 


calhepsinE 


0.12 


5.49 


45.35 



TABIE.4Q 

Pkey: Unique Eos probeset identifier number 
CAT number: Gene ctustef number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

431146 32854 J Z83850AA459717AW965384AA333535 
434037 37918J AF116601 AI110691 AF053566 
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TABLE 4C 

Pkey : Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (G1) numbers. "Dunham I. et at" refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489495. 
Strand: indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposib'on 


401271 


9797373 


Minus 


61292-61911 


401519 


6649315 


Plus 


157315-157950 


401905 


8671966 


Pius 


1 53965-1 54441 , 1 56599- 1 568 1 9 


402075 


8117407 


Pius 


121907-122035,122804-122921,124019-124161.124455-124610,125672-126076 


402110 


8131678 


Minus 


173889-174062 


402522 


9798493 


Plus 


20605-20731 


402727 


9211324 


Plus 


54596-54777" 


403005 


5791501 


Minus 


16945-17053,20018-20403 


403106 


7331404 


Plus 


77162-77350,81338-81511 


403214 


7630945 


Minus 


76723-77027,79317-79484 


404152 


9884757 


Plus 


41 1 1 141 281 ,4549545716.47801-4791 0 


404488 


8113286 


Minus 


64835-64994 


404829 


6624702 


Minus 


4913-5093,7310-7469,9472-9621,9951-10082 


404860 


8979555 


Plus 


65852-66081 


404977 


3738341 


Minus 


43081-43229 


405494 


B050952 


Minus 


70284-70518 


405735 


9931101 


Minus 


29854-29976 


405738 


9943998 


Pius 


4437045410 


406076 


9123123 


Plus 


89972-90319 


406397 


9256243 


Minus 


127317-127454 


406471 


9795566 


Plus 


87383-87589 
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TABLE SA Gsnes upreoulated in bladder cancer 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene TBle: Unigene gene title 

R1 90th percenfile of bladder tumor Als divided by the 90lh percentile of normal body sample Als 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


459702 


AJ204995 




gb:an03c03jc1 Stratagene schizo brain SI 


11.03 


437915 


AI637993 


Hs.202312 


Homo sapiens done N 1 1 NTera201 teratoca 


10.40 


404917 






Target Exon 


9.65 


401056 


AI380797 




C11000517':gif7293105|gb|AAF48490.1j (AE 


9.00 


447475 


Hs.1 58992 


ESTs 


&92 


427335 


M448542 


Hs.251677 


G antigen 7B 


8.65 


450061 


A1797034 


Hs.346238 


ESTs 


8.35 


401335 






Target Exon 


7.95 


424264 


080400 


Hs.239388 


Human DNA sequence from clone RP1-304B14 


7.35 


409041 


AB033025 


Hs.50081 


Hypothetical protein, XPJJ51860 (KIAA1 1 9 


7.20 


436608 
451950 


M628980 




down syndrome critical region protein DS 


4.86 


AW292317 


Hs.213307 


ESTs 


4.45 


406542 






C19000728«:gi|12585552|sp|Q9Y2Q1|Z257 HU 


3.73 


437931 


A1249468 


Hs. 124434 


ESTs 


3.70 


443133 


AI033878 


Hs.41379 


ESTs 


3.60 


434487 


AF143867 


Hs.337588 


ESTs, Moderately similar to S65657 alpha 


3.37 


402239 






Target Exon 


3.37 


443162 


T49951 


Hs.9029 


DKFZP434G032 protein 


3.21 


403383 






Target Exon 


3.13 


438315 


R56795 


Hs.82419 


ESTs 


3.04 


452827 


AI571835 


Hs.55468 


ESTs 


3.01 


416225 


AA577730 


Hs. 188684 


ESTs, Weakly similar to PC4259 ferritin 


3.01 


402948 


W92620 




NM_025206:Homo sapiens hypoiheiical proi 


291 


429983 


Hs.260855 


ESTs 


284 


429238 


NMJ)02849 


Hs.198288 


protein tyrosine phosphatase, receptor t 


278 


444371 


BE540274 


Hs.239 


forkhead box M1 


275 


417003 


AL038170 


Hs.80756 


betame-bornocysteine methyitransferase 


270 


414906 


AA157911 


Hs.72200 


ESTs 


2.70 


425206 


NIVL002153 


Hs.155109 


hydroxysteroid (17-beta) dehydrogenase 2 


268 


422283 


AW411307 


Hs.1 14311 


CDC45 (cell division cycle 45, S.cerevis 


262 


429345 


R11141 


Hs.1S9695 


hypothetical protein 


261 


414221 


AW450979 




gb:UI-H-BI3-ala-a.1 2-0-UU1 NCI.CGAP^Su 


260 


402305 


* 




C19000735*:gi|4508027|reflNP_003414.1| z 


254 


432842 


AW674093 


Hs.334822 


hypothetical protein MGC4485 


251 


427719 


AI393122 


Hs.134726 


ESTs 


251 


455797 


BE091833 




gb:IL2-BT0731-260400476-F04 BT0731 Homo 


250 


414807 


AI738616 


Hs.77348 


hydroxyprostaglandin dehydrogenase 15-(N 


2.49 


456967 


AW004056 


Hs.1 68357 


T-box2 


249 


406387 






Target Exon 


248 


417997 


AA418189 


Hs.23017 


Homo sapiens cDNA FU22747 lis, clone K 


2.48 


415752 


BE314524 


Hs.78776 


putative transmembrane protein 


246 


411248 


AA551538 


Hs.334605 


Homo sapiens cDNAFU 14408 lis, clone HE 


243 


434293 


NM 004445 


Hs.3796 


EphB6 


242 


433078 


AW015188 


Hs.121575 


Homo sapiens cDNA FU12231 fis, clone MA 


240 


425997 


AK000086 


Hs.1 65948 


hypothetical protein FU20079 


238 


418322 


AA284166 


Hs.84113 


cyctovdependent kinase inhibitor 3 {CDK 


235 


452012 


AA307703 


Hs.279766 


kinesin family member 4A 


2.34 


445600 


AF034803 


Hs.1 2953 


PTPRF interacting protein, binding prole 


233 


418941 


M452970 


Hs.239527 


E1B-55kDa-associated protein 5 


233 


454609 


AW810204 




gb:MR4.ST0125-021199417-d08 ST0125 Homo 


228 


444476 


AF020038 


Hs.1 1223 


isocf'traie dehydrogenase 1 (NADP), sotob 


226 


420005 


AW271106 


Hs.133294 


ESTs 


222 


439826 


NWL014965 


Hs.6705 


KIAA1042 protein 


222 


405531 






Target Exon 


221 


436569 


BE439539 


Hs,279837 


glutathione S-transferase M2 (muscle) 


218 


404394 






ENSP00000241075:TRRAP PROTEIN. 


217 


427479 


BE410092 


Hs.1 78471 


WAA0798 gene product 


217 


435904 
431620 


AF261655 


Hs.8910 


1,2-aJpha-mannosidase IC 


213 


AA126109 


Hs.264981 


2'-5 > -oIigoadenylate synthetase 2 (69-71 


212 


426682 


AV660038 


Hs.2056 


UDP glycosyitransferase 1 family, polype 


210 


422765 


AW409701 


Hs.1 578 


baculoviral 1AP repeat-containing 5 (sur 


210 


451385 


AA017656 




gbze39hQ1.r1 Soares retina N2b4HR Homo 


209 


403477 






C3002160* :gJI7662420IreflNP.055738.1| KI 


209 


417151 


AA194055 


Hs.293858 


ESTs 


208 


448262 


AW880830 


Hs.186273 


ESTs 


207 


415192 


D17793 


Hs.78183 


aldo-keto reductase family 1, member C3 


204 


402994 






NMJX)2463*:Homo sapiens niyxovirus (infiu 


204 
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40 
45 
50 
55 
60 
65 
70 
75 



426053 


U66105 


Hs.1 721 82 


poly(A)-blnding protein, cytoplasmic 1 


2.02 


423271 


W47225 


Hs.1 26256 


interteukln 1, beta 


2.01 


419741 


NMJKJ7019 


Hs.93002 


ubiquilin carrier protein E2-C 


1.99 


407581 


R48402 


Hs.1 73508 


P3ECSL 


1.95 


410197 


NMJJ05518 


Hs.59889 


3-hydroxy«3-methylglutaryl-Coenzyme A sy 


1.95 


427122 


AW057736 


Hs.323910 


HER2 receptor tyrosine kinase (c-erb-b2, 


1.93 


436481 


AA379597 


Hs.5199 


HSPC150 protein similar to ubiqultin-con 


1.93 


436251 


BE515065 


Hs.296585 


nucleolar protein (KKE/D repeal) 


1.89 


401961 






NM_021626:Homo sapiens serine carboxypep 


1.86 


434042 


AI589941 


Hs.8254 


Homo sapiens, Similar to tumor different 


1,85 


447532 


AK000614 


Hs.18791 


hypothetical protein FU20607 


1.84 


418526 


BE019020 


Hs.85838 


solute carrier family 16 (monccarboxyllc 


1.83 


429612 


AF062649 


Hs.252587 


pituitary tumor-transforming 1 


1.80 


422164 


NMJJ14312 


Hs.1 12377 


cortic al thymocyte receptor (X taevis 


1.80 


422247 


U18244 


Hs.1 13602 


solute carrier family 1 (high affinity a 


1.79 


410407 


X66639 


Hs.63287 


carbonic anhydrase (X 


1.78 


412115 


AK001763 


Hs.73239 


hypothetical protein FU10901 


1.77 


414809 


A1434699 


Hs.77356 


transferrin receptor (p90, C071) 


1.75 


432210 


AI567421 


Hs.273330 


Homo sapiens, clone 1MAGE:3544662, mRNA, 


1.74 


427239 


BE27Q447 


Hs. 174070 


ubiquitin carrier protein 


1.74 


459198 


AI086347 


Hs.151138 


ESTs 


1.74 


421066 


AU076725 


Hs.101408 


branched chain aminotransferase 2, mitoc 


1.71 


424687 


J05070 


Hs.151738 


matrix rnetalJoproteinase 9 (gelalinas© B 


1.70 


450663 


H43540 


Hs.25292 


ribonuclease HI, large subunit 


1.70 


417324 


AW265494 




ESTs 


1.67 


453883 


AI638516 


Hs.347524 


cofactor required for Sp1 transcriptiona 


1.66 


428000 


R35145 


Hs.291904 


accessory proteins BAP31/BAP29 


1.65 


450635 


AW403954 


Hs.25237 


mesenchymal stem cell protein DSCD75 


1.63 


423397 


NMJJ01838 


Hs.1652 


chemokine (C-C motif) receptor 7 


1.62 


415440 


D83782 


Hs.78442 


SREBP CLEAVAGE-ACTIVATING PROTEIN 


1.62 


428028 


U52112 


Hs.182018 


interleukin-1 receptor-associated kinase 


1.62 


426783 


Z19084 


Hs.172210 


MUF1 protein 


1.62 


445937 


A1452943 


Hs.321231 


UDP-Gal:betaGlcNAc beta 1 ,4- galactosyl! 


1.61 


445462 


AA378776 


Hs.288649 


hypothetical protein MGC3077 


1.60 


400965 






C1 10021 90*:gi|1 2737279|refJXP^0121 63.1 1 


1.59 


432269 


NMJJ02447 


Hs.2942 


macrophage stimulating 1 receptor (c-met 


1.59 


429578 


AI969028 


Hs.99389 


ESTs 


1.59 


449027 


AJ271216 


Hs,22880 


dipeptidylpepttdase III 


'1.59 


431840 


AA534908 


Hs.2860 


POU domain, class 5, transcription facto 


1.59 


417900 


BE250127 


Hs.82906 


CDC20 (cell division cycle 20, S. cerevi 


1.59 


429002 


AW248439 


Hs.2340 


Junction plakoglobin 


1.57 


442410 


AW996503 


Hs.197680 


ESTs 


1.56 


407601 


AC002300 


Hs.37129 


sodium channel, nonvoltage-gated 1, beta 


1.55 


418543 


NMJJ05329 


Hs.85962 


hyaluronan synthase 3 


1.54 


424611 


IML001421 


Hs.151139 


E74-like factor 4 (ets domain transcript 


1.54 


414732 


AW410976 


Hs.77152 


minichromosome maintenance deficient (S, 


1.54 


408930 


AA146721 


Hs.334686 


hypothetical protein FU21588 


1.53 


448993 


A1471630 


Hs.8127 


KIAA0144 gene product 


1.52 


414053 


BE391635 


Hs.75725 


transgeiin 2 


1.51 


433662 


W07162 


Hs.150826 


CATX-8 protein 


1.50 


432562 


BE531048 


Hs.278422 


DKFZP586G1 122 protein 


1.50 


402260 






NM_001436*:Homo sapiens ftbrillann (FBL 


1.48 


426127 


L36983 


Hs.167013 


dynamin 2 


1.48 


427557 


NMJJ02659 


Hs.179657 


plasminogen activator, urokinase recepto 


1.48 


418026 


BE379727 


Hs.83213 


fatty acid binding protein 4, adipocyte 


1.47 


418960 


NM004494 


Hs.89525 


hepatoma-derived growth factor (high-mob 


1.46 


428293 


BE250944 


Hs.183556 


solute carrier family 1 (neutral amino a 


1.46 


432344 


AJ476474 


Hs.248156 


ESTs 


1.46 


453449 


W16752 


Hs.32981 


sema domain, immunoglobulin domain (Ig), 


1.46 


450690 


AA296696 


Hs.333418 


FXYD domain-containing ion transport reg 


1.46 


441940 


AW298115 


Hs.128152 


ESTs 


1.45 


409893 


AW247090 


Hs.57101 


minichromosome maintenance deficient (S. 


1.44 


439318 


AW837046 


Hs.6527 


G protein-coupled receptor 56 


1.42 


422565 


BE259035 


Hs.1 18400 


singed (Drosophila)-like (sea urchin fas 


1.41 


428928 


BE409838 


HS.UWoo/ 


cadnenn l, type l. c-cadnertn (epilneu 


1.41 


445417 


AK001058 


Hs.12680 


Homo sapiens cDNA FU10196 fis, clone HE 


1.39 


441565 


AW953575 


Hs.303125 


p53nnduced protein PIGPC1 


1.37 


439180 


AI393742 


Hs.199067 


v-err>b2 avian erythroblastic leukemia v 


1.35 


418399 


AF131781 


Hs.84753 


hypothetical protein FU 12442 


1.32 


432636 


AA340864 


Hs.278562 


ciaudin 7 


1.32 


439053 


BE2445B8 


Hs.6456 


chaperonin containing TCP 1 , subunit 2 (b 


1.32 


413762 


AW411479 


Hs.848 


FK50W)inding protein 4 (59kD) 


1.31 


453914 


NM 000507 


Hs.574 


fructose- 1 ,6-*>tsphosphatase 1 


1.28 


430056 


X97548 


Hs.228059 


KRAB-associated protein 1 


1.24 


451524 


AK001466 


Hs.26516 


hypothetical protein FU10604 


1.23 


431441 


U81961 


Hs.2794 


sodium channel, nonvoltage-gated 1 alpha 


1.20 


439863 


BE547830 


Hs.9408 


paired ImmunogtobuRn-tike receptor beta 


1.19 
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Hs.26557 



451541 BE279383 

406906 Z25424 

429504 X99133 .......... 

414002 NM.006732 Hs.75578 

431369 BE184455 Ks.251754 



ptakophinn3 1.16 

gb:H.sapiens protein-serme/Threonine W 1.11 

Hs.204238 Gpocalin 2 (oncogene 24p3) 1.00 

' FBJ murine osteosarcoma viral oncogene h 0.86 

secretory leukocyte protease Inhibitor! 0.61 



TABLE 5p 



Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey 



CAT Number Accession 



414221 142696.1 AW450979AA1 36653 AA 136656 AW4 19381 AA984358 AA492073BE1 68945 AA809Q54 AW238038 BE011212 8E011359 BE0113S7 
BE011368 BE011362 BE011215 BE011365 BE011363 

166714.1 AW265494 AA455904 AA195677 AW265432 AW991605 AA456370 

42361 J AA628980 AI126603 BE504035 

86787.1 AA017656AA017374AA019761 

1226517J AW810204 AW810555 AW810196 AW810619 AW810507 

1356826J BE091833 BE091874 BE091871 



417324 
436508 
451385 
454609 
455797 

TABLE SC* 



Pkey: 
Ref: 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this cotumn are Genbank Identifier (Gl) numbers. "Dunham I. et a!." refers to the 
sequence of human chromosome 22.' Dunham I. et a)., Nature (1999) 402:489-495. 
Strand: Indicates DMA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



publication entitled "The DNA 



Pkey 


Ref 


Strand 


NLposition 


400965 


7770576 


Minus 


173043-173564 


401066 


8217436 


Plus 


71448-71574 


401335 


9384B81 


Plus 


15736-16352 


401961 


4581193 


Minus 


124054-124209 


402239 


7690131 


Plus 


38175-38304,42133-42256 


402260 


3399665 


Minus 


113765-113910,115653-115765,116808-116940 


402305 


7328724 


Plus 


40832-41362 


402948 


9368458 


Minus 


143456-143626,143808-143935 


402994 


2996643 


Minus 


4727-4969 


403383 


9438267 


Minus 


119837-121197 


403477 


9958251 


Plus 


111834-112008 


404394 


3135305 


Minus 


37121-37205,37491-37762,41053-41140,41322-41593,41773-41919 


404917 


7341851 


Bus 


49330-49498 


405531 


9665194 


Plus 


35602-35803 


406387 


9256180 


Plus 


116229-116371,117512-117651 


406542 


7711499 


Plus 


117335-118473 
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TABLE 6A: Genes uprequtated In bladder cancer 

Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 90lh percentile of bladder tumor Ats divided by the 90th percentile of normal urothelium biopsy Ais 

R2 90th percentile of bladder tumor Als divided by the 90th percentile of normal urothelium biopsy and normal bladder Ais 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


439926 


AW014875 


Hs.137007 


ESTs 


11.31 


11.31 


413324 


V00571 


Hs.75294 


corticotropin releasing hormone 


9.15 


9.15 


421110 


AJ250717 


Hs.1355 


cathepsin E 


9.07 


9.07 


417308 


H60720 


Hs.81892 


KIAA0101 gene product 


8.50 


8.50 


418406 


X73501 


Hs.84905 


cytokeratin20 


8.10 


8.10 


446619 


AU076643 


Hs.313 


secreted phosphoprotein 1 (osteopontin, 


7.98 


7.98 


433001 


AF217513 


Hs.279905 


clone HQ031OPRO0310p1 


7.67 


7.67 


408243 


Y00787 


Hs.624 


interteukin 8 


7.56 


7.56 


416065 


BE267931 


Hs.78996 


proliferating ceil nuclear antigen 


7.17 


5.17 


425397 


J04088 


Hs.156346 


topoisomerase (DNA) il alpha (170kDJ 


7.17 


8.24 


414183 


AW957446 


Hs.301711 


ESTs 


7.14 


4.62 


418007 


M13609 


Hs.83169 


matrix metaltoproteinase 1 (interstitial 


7.12 


7.12 


426125 


X87241 


Hs.166994 


FAT tumor suppressor (Drasophila) homolo 


6.90 


3.38 


427337 


Z46223 


Hs.176663 


Fc fragment of IgG, low affinity lllb, r 


6.85 


4.98 


441633 


AW958544 


Hs.112242 


normal mucosa of esophagus specific 1 


6.42 


6.42 


438091 


AW373062 




nuclear receptor subfamily 1 , group 1, m 


6.32 


6.32 


413063 


AL035737 


Hs.75184 


chiiinase 3-like 1 (cartilage glycoprote 


ao9 


a 67 


414219 


W20010 


Hs.75823 


ALL1 -fused gene from chromosome 1q 


5.93 


4.68 


405033 






C1002652*:gi|544327|sp|Q04799lFMO5_RABIT 


5.84 


5.84 


413132 


NMJ)06823 


Hs.75209 


protein kinase (cAMP-dependent, catalyti 


5.79 


5.68 


428336 


AA503115 


Hs.183752 


microseminoprotein, beta- 


5.78 


4.57 


449230 


BE613348 


Hs.211579 


melanoma ceil adhesion molecule 


5.69 


5.82 


423673 


BE003054 


Hs.1695 


matrix metaltoproteinase 12 (macrophage 


5.69 


7.30 


415511 


AI732617 


Hs.182362 


ESTs 


5.65 


5.65 


426028 


Nfoomo 


Hs.172028 


a disintegrin and metaltoproteinase doma 


5.60 


5.60 


421948 


142583 


Hs.334309 


keratin 6A 


5.59 


14.20 


428651 


AF196478 


Hs.188401 


annexin A10 


5.55 


5.55 


424008 


R02740 


Hs.137555 


putative chemokine receptor; GTP-binding 


5.38 


5.59 


406687 


M31126 




matrix metalloproteinase 1 1 (stromerysin 


5.36 


5.34 


439453 


BE264974 


Hs.6566 


thyroid hormone receptor interactor 1 3 


5.35 


5.35 


408246 


N55669 


Hs.333823 


mitochondrial ribosoma) protein L13 


5.20 


3.50 


427678 


BE267756 


Hs.180312 


mitochondrial ribosoma] protein S16 


5.13 


4.10 


426158 


NM 001982 


Hs.199067 


v-erb-b2 avian erythroblastic leukemia v 


4.97 


3.71 


442315 


AA173992 


Hs.7956 


£STs, Moderately similar to ZN9LHUMAN Z 


4.90 


4.90 


418322 


AA284166 


Hs.84113 


cydin-dependent kinase inhibitor 3 {CDK 


4.90 


4.77 


417720 


AA205625 


Hs.208067 


ESTs 


4.84 


7.34 


423979 


AF229181 


Hs.136644 


CS box-containing WO protein 


4.81 


4.61 


420981 


L40904 


Hs.100724 


peroxisome proliferative activated recep 


4.81 


4.43 


433470 


AW960564 




transmembrane 4 superfamfly member 1 


4.72 


4.72 


429138 


AB020657 


Hs.197298 


NS1-binding protein 


4.71 


4.71 


408063 


BE086548 


Hs.42346 


calclneurin-binding protein calsarcin-1 


4.71 


4.71 


452714 


AW770994 


Hs.30340 


hypothetical protein KIAA1165 


4.69 


4.69 


442432 


BE093589 


Hs.38178 


hypothetical protein FU23468 


4.68 


4.68 


424834 


AK001432 


Hs.1 53408 


Homo sapiens cDNA FU10570 fis. clone NT 


4.65 


4.65 


446921 


AB012113 


Hs.16530 


small inducible cytokine subfamily A (Cy 


4.64 


4.64 


427490 


Z95152 


Hs.1 78695 


mitogen-aclivated protein kinase 13 


4.63 


5.12 


412490 


AW803564 


Hs.288850 


Homo sapiens cDNA: FU22528 fis, clone H 


4.61 


4.61 


418030 


BE207573 


Hs.83321 


neuromedin B 


4.60 


4.60 


401192 






Target Exon 


4.60 


4.29 


426761 


AI015709 


Hs.172089 


Homo sapiens mRNA; cONA DKFZp586l2022 (f 


4.59 


3.51 


452747 


BE153855 


Hs.61460 


Ig superfamiry receptor LNIR 


4.68 


5.33 


449618 


AI076459 


Hs.15978 


K1AA1272 protein 


4.58 


4.58 


423725 


AJ403108 


Hs.132127 


hypothetical protein LOC57822 


4.55 


4.55 


415701 


NM.003878 


Hs.78619 


gamma-gtutamy) hydrolase (conjugase, fol 


4.52 


4.70 


446742 


AA232119 


Hs.16085 


putative G-protein coupled receptor 


4.49 


4.11 


419433 


AA814807 


Hs.7395 


hypothetical protein FU 231 82 


4.48 


4.48 


412326 


R07566 


Hs.73817 


small inducible cytokine A3 (homologous 


4.47 


4.47 


427528 


AU077143 


Hs.179565 


minichromosome maintenance deficient (S. 


4.45 


4.45 


444371 


BE540274 


Hs.239 


forkhead boxM1 


4.44 


3.87 


444006 


BE395085 


Hs.1 0086 


type I transmembrane protein Fnl4 


4.43 


3.63 


424308 


AW975531 


Hs.154443 


minichromosome maintenance deficient (S. 


4.43 


4.43 


401093 






C12000586*:gl|6330167|dbj|BAA86477.1| (A 


4.40 


4.40 


447644 


AW861622 


Hs.108646 


Homo sapiens cDNA FU14934 fis, clone PL 


4.39 


4.39 


417933 


X02308 


Hs.82962 


thymidylate synthetase 


4.38 


4.35 


409461 


AA382169 


Hs.54483 


N-myc (and STAT) interactor 


4.36 


3.68 


401451 






NM_004496 , :Homo sapiens hepatocyte nude 


4.35 


4.35 


450746 


D82673 


Hs.278589 


general transcription factor II, 1 


4.35 


3.36 



WO 03/003906 
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10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



4145B3 
434203 


S78296 


Hs.76888 


hypothetical protein MGC12702 


4.34 


4.74 


BE262677 


Hs.283558 


hypothetical protein PR01855 


4.31 


4.31 


417615 


BE548641 


Hs.82314 


hypoxanthine phosphoribosyl transferase 1 


4.30 


4.30 


416815 


U41514 


Hs.80120 


UDP^aceryJ-alpha-D^alactosarnine:polyp 


4.30 


4.30 


440086 


NMJM5402 


Hs.288757 


v-ral simian leukemia viral oncogene horn 


4.29 


4.29 


417715 


AW969587 


Hs.86366 


ESTs 


4.27 


7.45 


409757 


NM.001898 


Hs.123114 


cystatin SN 


4.24 


3.39 


412140 


AA219691 


Hs.73625 


RAB6 interacting, kinesin-like (rabkines 


4.24 


4.24 


432842 


AW674093 


Hs.334822 


hypothetical protein MGC4485 


4.21 


4.12 


446847 


T51454 


Hs.82845 


Homo sapiens cDNA: FU21930 fis, done H 


4.20 


4.20 


436856 


A1469355 


Hs.127310 


ESTs 


4.19 


4.19 


42B450 


NM.014791 


Hs.184339 


WAA01 75 gene product 


4.16 


4.90 


425234 


AW152225 


Hs.165909 


ESTs, Weakly similar to 138022 hypotheti 


4.14 


4.14 


409231 


AA446644 


Hs.692 


GA733-2 antigen; epithelial glycoprotein 


4.14 


5.99 


426283 


NMJ)03937 


Hs.169139 


kynureninase (L-kynurenine hydrolase) 


4.12 


4.12 


446849 


AU076617 


Hs.16251 


cleavage and poryadenytalion specific fa 


4.12 


3.43 


400843 






NM_003105*:Homo sapiens sortilin-reJated 


4.11 


5.51 


449722 


BE280074 


Hs.23960 


cycfin B1 


4.09 


4.09 


405506 






Target Exon 


4.09 


3.75 


420344 


. BE463721 


Hs.97101 


putative G protein-coupled receptor 


4.07 


4.07 


426997 


BE620738 


Hs.1 73125 


pepthJylproryl isomerase F (cydophilin 


4.05 


4.80 


456525 


AW468397 


Hs.100000 


S100 caldum-btnding protein A8 (calgran 


4.03 


7.64 


437150 


R51407 


Hs.77910 


3-hydroxy-3-melrr/lglutaryl-Coen2yme A sy 


4.02 


4.02 


413794 


AF234532 


Hs.61638 


myosin X 


4.02 


4.02 


422511 


AU076442 


Hs.1 17938 


collagen, type XVII, alpha 1 


4.02 


4.72 


414020 


NNL002S84 


Hs.75703 


small inducible cytokine A4 (homologous 


4.01 


3.99 


416391 


A1878927 


Hs.79284 


mesoderm specific transcript (mouse) horn 


4.01 


4.01 


422B09 


AK001379 


Hs.121028 


hypothetical protein FU10549 


4.00 


4.00 


400277 






Eos Control 


4.00 


3.47 


415791 


H09366 


Hs.78853 


uracil-OMA glycosyiase 


3.99 


3.37 


412610 


X90908 


Hs.74126 


fatty acid binding protein 6, ileal (gas 


3.98 


4.77 


427557 


NM 002659 


Hs.179657 


plasminogen activator, urokinase recepto 


3.98 


3.95 


413753 


U17760 


Hs.75517 


laminin, beta 3 (nicein (125k0), kafinin 


3.96 


6.95 


420859 


AW468397 


Hs.100000 


S100 calcium-binding protein A8 (calgran 


3.92 


5.04 


400409 


AF153341 




Homo sapiens winged helix/forkhead trans 


3.91 


3.88 


408988 


AL1 19844 


Hs.49476 


Homo sapiens clone TUA8 Cri-du-chat regi 


3.90 


3.90 


411678 


A1907114 


Hs.71465 


squalene epoxidase 


3.89 


3.89 


429113 


D28235 


Hs.196384 


prostaglandin-endoperoxide synthase 2 (p 


3.87 


3.87 


42B428 


AL037544 


Hs.184298 


cycfin-dependent kinase 7 (homolog of Xe 


3.87 


3.87 


442932 


AA457211 


Hs,8858 


bromodomain adjacent to zinc finger doma 


3.85 


4.50 


429083 


Y09397 


Hs.227817 


BCL2-reIated protein A1 


3.85 


3.85 


439963 


AW247529 


Hs.6793 


platelet-activating factor acetythydrola 


3.82 


3.77 


441362 


BE614410 


Hs.23044 


RAD51 (S. cerevisiae) homolog (EcoBRe 


3.82 


3.82 


430589 


AJ002744 


Hs.246315 


UDP-N-acetyl-alpha-D-galactosarriine^oryp 


3.81 


3.81 


417771 


AA804698 


Hs.82547 


retinoic acid receptor responder (tazaro 


3.81 


3.62 


430259 


BE550182 


Hs.127826 


RalGEF-like protein 3, mouse homolog 


3.80 


3.80 


447973 


AB011169 


Hs.20141 


similar to S. cerevisiae SSM4 


3.77 


3.77 


404875 






NM_022819*:Homo sapiens phosphoSpase A2 


3.77 


3.46 


411299 


BE409857 


Hs.69499 


hypothetical protein 


3.76 


3.76 


418827 


BE327311 


Hs.47166 


HT021 


3.76 


3.76 


446839 


8E091926 


Hs.16244 


mitotic spindle coQed-coii related prot 


3.75 


3.75 


407137 


T97307 




gb:ye53h05.s1 Soares fetal Ever spleen 


3.73 


3.73 


433376 


AI249361 


Hs.74122 


caspase 4, apoptosis-related cysteine pr 


3.71 


3.71 


400294 


N95796 


Hs.278695 


Homo sapiens prostein mRNA, complete cds 


3.70 


3.45 


409518 


BE384836 


Hs.3454 


KIAA1821 protein 


3.69 


3.69 


430024 


A1808780 


Hs.227730 


integrin, alpha 6 


3.69 


3.69 


426088 


AF038007 


Hs.166196 


ATPase, Class I, type 88, member 1 


3.68 


3.68 


418478 


U38945 


Hs.1 174 


cycfin-dependent kinase inhibitor 2A (me 


3.68 


3.66 


414761 


AU077228 


Hs.77256 


enhancer of zeste (Orosophila) homolog 2 


3.67 


3.67 


413670 


AB0C0115 


Hs.75470 


hypothetical protein, expressed in osteo 


3.67 


3.41 


424840 


D79987 


Hs.153479 


extra spindle poles, S. cerevisiae, homo 


3.67 


3.88 


434263 


N34895 


Hs.44648 


ESTs 


3.65 


3.65 


438280 


AW015534 


Hs.217493 


annexi n A2 


3.63 


3.36 


443426 


AF098158 


Hs.9329 


chromosome 20 open reading frame 1 


3.63 


3.68 


408989 


AW361666 


Hs.49500 


WAA0746 protein 


3.61 


3.66 


416640 


BE262478 


Hs.79404 


neuron-specific protein 


3.60 


4.22 


416926 


K03109 


Hs.108920 


HT01 8 protein 


3.59 


3.59 


414368 


W70171 


Hs.75939 


uridine monophosphate kinase 


3.59 


3.53 


402727 






NM_025065:Homo sapiens hypothetical prot 


3.58 


3.58 


419381 


AB023420 


Hs.90093 


heat shxk 70KD protein 4 


3.56 


3.77 


416114 


AI695549 


Hs.183868 


glucuronidase, beta 


3.55 


3.55 


424941 


AA128376 


Hs.153884 


ATP binding protein associated with cell 


ass 


3.55 


431958 


X63629 


Hs.2877 


cadherin 3, type 1 , P-cadherin (placenta 


3.54 


4.63 


429238 


NNL002849 


Hs.198288 


protein tyrosine phosphatase, receptor t 


3.53 


3.92 


420159 


A1572490 


Hs.99765 


Homo sapiens cDNA: FU21245 fis, done C 


3.51 


5.77 


400289 


X07820 


Hs.2258 


matrix metaSoproteinase 10 (stromelysin 


3.50 


3.50 


418203 


X54942 


Hs,83758 


CDC28 protein kinase 2 


3.50 


3.50 
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415220 


AA431880 


Hs.181174 


ESTs, Weakly similar to T19201 hypolheti 


3.50 


3.50 


428371 


AB012193 


Hs.183874 


culKn4A 


3.46 


3.46 


418663 


AK001100 


Hs.41690 


desmocollin 3 


3.45 


4.74 


404977 






losuQrhGke growth factor 2 (somatomedi 


3.45 


3.89 


422663 


AW500087 


H3.1 19014 


zinc finger protein 175 


3.44 


3.44 


434061 


AW024973 


Hs.283675 


NPD009 protein 


3.41 


5.64 


418113 


AI272141 


Hs.83484 


SRY (sex determining regbn Y)-box 4 


3.41 


4.32 


431689 


AA305686 


Hs.267695 


UDP-Ga):betaGlcNAc beta 1,3-gaiactosyltr 


3.40 


3.40 


411943 


8E502436 


Hs.7962 


ESTs, Weakly similar to S44608 C02F5.6 p 


3.39 


4.27 


420005 


AW271106 


Hs.133294 


ESTs 


a38 


3.40 


453450 


AW797627 


Hs.347459 


ADP-fibosylation factor 6 


138 


3.87 


410315 


A1638871 


Hs.17625 


Homo sapiens cDNA: FU22524 fis, clone H 


3.36 


3.36 


428839 


AI767756 


Hs.82302 


Homo sapiens cDNA FU1481 4 fis, clone NT 


3.35 


3.35 


437469 


AW753112 


Hs.15514 


hypothetical protein MGC3260 


3.35 


3.35 


407151 


H25836 


Hs.301527 


ESTs, Moderately similar to unknown [H.s 


3.34 


3.34 


428157 


A1738719 


Hs.1 98427 


hexokinase 2 


3.33 


3.73 


450293 


N36754 


Hs.1 71 118 


hypothetical protein FU00026 


3.33 


3.33 


400750 






Target Exon 


3.33 


3.33 


450139 


AK001838 


Hs.296323 


serum/glucocorticoid regulated kinase 


3.33 


3.33 


412636 


NM_004415 




desmopiakinfDPI, DPI!) 


3.30 


4.81 


447578 


AA912347 


Hs,136585 


ESTs, Weakly similar to JC5314 CDC28/cdc 


3.27 


3.38 


430315 


NM.004293 


Hs.239147 


guanine deaminase 


3.26 


4.30 


421594 


R45689 


Hs.21889 


Homo sapiens cDNA FU12976 fis, clone NT 


3.26 


3.41 


443030 


R68048 


Hs.9238 


hypothetical protein FU23516 


3.19 


3.34 


436911 


AA142984 


Hs.5344 


adaptor-related protein complex 1, gamma 


3.17 


3.40 


440006 


AK000517 


Hs.6844 


hypothetical protein FU20510 


3.06 


3.52 


443171 


BE281128 


Hs.9030 


TONDU 


3.05 


3.83 


429343 


AK000785 


Hs.199480 


Homo sapiens, Similar to epsin 3, clone 


3.01 


3.53 


408380 


AF123050 


Hs.44532 


diubiquitin 


2.99 


4.11 


421508 


NM 004833 


Hs.105115 


absent in melanoma 2 


2.99 


3.67 


439750 


AL359053 


Hs.57664 


Homo sapiens mRNA full length insert cDN 


2.97 


4.55 


452046 


AB018345 


Hs.27657 


K1AA0802 protein 


2.95 


3.39 


451940 


AI735759 


Hs.52620 


integrin, beta 8 


2.93 


3.58 


407722 


BE252241 


Hs.38041 


pyridoxal (pyridoxine, vitamin B6) kinas 


2.90 


3.68 


422282 


AF019225 


Hs.1 14309 


apolipoprotein L 


2.89 


3.57 


402230 






Target Exon 


2.88 


5.36 


406685 


M18728 




gb:Human nonspecific crossreacting antig 


2.80 


5.80 


417880 


BE241595 


Hs.82848 


seleclin L (lymphocyte adhesion molecule 


2.79 


3.89 


447957 


NM_014821 


Hs.20126 


KIAA0317gene product 


2.75 


3.45 


418004 


U37519 


Hs.87539 


aldehyde dehydrogenase 3 family, member 


2.75 


3.46 


417275 


X63578 


Hs.295449 


paralbumin 


2.73 


3.54 


431211 


M86849 


Hs.323733 


gap junction protein, beta 2, 26kD (conn 


2.72 


8,39 


401781 






Target Exon 


2.62 


4.15 


407242 


M18728 




gb:Human nonspecific crossreacting antig 


2.54 


5.96 


428423 


AU076517 


Hs.184276 


solute carrier family 9 (sodium/hydrogen 


2.52 


4.27 


430200 


BE613337 


Hs.234896 


geminin 


£52 


4.19 


451035 


AU076785 


Hs.430 


plastin 1 (1 Isoform) 


2.51 


4.15 


443162 


T49951 


Hs.9029 


OKFZP434G032 protein 


2.48 


3.66 


441495 


AW294603 


Hs.127039 


ESTs 


2.45 


3.60 


449246 


AW411209 


Hs.23363 


hypothetical protein FU10983 


2.45 


3.52 


401780 






NM_0O5557*:Homo sapiens keratin 16 (foca 


2.22 


4.49 


417079 


U65590 


Hs.81134 


interieukin 1 receptor antagonist 


2.20 


3.55 


422168 


AA586894 


Hs.112408 


S1 00 calcium-binding protein A7 (psorias 


2.15 


6.08 


439394 


AA149250 


Hs.56105 


ESTs 


205 


3.95 


427315 


AA179949 


Hs.175563 


Homo sapiens mRNA; cONA DKFZp564N0763 (f 


1.79 


3.68 



T AB L ED 

Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number ■ 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

412636 13165 1 NM 004415AL031058 M77830BE149760AW752599AW848723AW376697AW376817AW376699AW848371 AW376782 AW848789 
AW361413 AW849074 AW997139 AW799304 AW799309 BE077020 BE077017 BE185187 AW997196 BE156621 BE179915 BE006561 
BE143155 AW890985 BE002107 AW103521 AA857316 AW383133 BE011378 AW170253 BE185750 AW886475 BE160433 J0521 1 BE082576 
BE082584 BE004047 AW607238 AW377700 AW377699 BE082526 BE082505 BE082507 BE082514 AW178000 AW177933 A1905935 
AW747877 AW7481 14 BE148516 AW265328 AW847678 AW847688 AW365151 AW365148 AW365153 AW365156 AW365175 AW365157 
AW365154 AW068B40 BE005272 AW365145 BE001925 BE182166 BE144243 BE001 923 AI95 1766 A1434518 BE184920 BE1 84933 Af 284090 
BE184941 AW804674 BE184924 C04715 W39488 AW995615 BE184948 BE159646 AW606653 AA099891 AA131 128 AA337270 AA340777 
AW384371 AA852212 R58704 AW366566 AW364859 AA025851 AA025852 AA455100 AA719958 AW352220 AW996245 BE165351 8E073467 
AA377127 AW890264 AW609750 AW391912 AW849690T87267 AW853812 AA852213 W74149 BE0O9090 AA056401 H9101 1 AW368529 
AW390272 C18467 AW674920 N57176 AA026480 AW576767 H93284 AA026863 AW177787 AA026654 AW1 77786 BE092134 BE092137 
BE092136 AW177784 AI022862 BE091653 AW37681 1 AW848592 AA040018 BE185331 BE182164 AA368564 AW951576 T29918 AA131077 
W95048 W25458 AW205789 H90899 N29754 W32490 R20904 BE167181 BE167165 N84767 H27408 H30146 AI190590 C03378 AI554403 
AI205263 AA128470 AJ392926 AF139065 AW370813 AW370827 AW798417 AW798780 AW79B883 AW798569 R33557 AA149190 C03029 
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AW177783 AA088866 AW370829 M247685 BE002273 AI76081 6 AI439101 AW879451 A1700963 AA451923 AI340326 AI590975 T48793 
A1558096AI142882 AA039975 AI470146 AA94S936 BE067737 BE0677B6W19287 AA644381 AA702424 AI417612 A130S554 AI586869 
AI558892 AW190555 AI571075 AI220573 AA056527 A1471874 A1304772 AW517828 AI915596 AI627383 A1270345 AW021 347 AW166807 
AW1 05814 AI346078 AA552300 W95070 AI494069 A191 1702 AA149191 AA026864 AI830049 AI887258 AW780435 A1910434 AI819984 
AI858282 AI078449 AI025932 AI860584 AI635878 AA026047 AA703232 012062 AW192085 AA558154 AW514597 AW591892 T87181 
AA782066 AW243315 AW150038 AW28B383 AW004633 AI927207 AA782109 AW473233 A1804485 AW1 6921 6 A) 572669 AA602182 
AW01 5480 AW771865 AI270027 AA96181 6 AA283207 AI076952 Af498487 AI348053 AI783914 H44405 AW7991 18 AA128330 AA515500 
AA918281 W02156 A1905927 AA022701 W38382 R20795 T77861 AW860878 

433470 3672.1 AW960564 AA092457 T55890 056120 T92525 AI815987 BE 182608 BE182595 AW080238 M90657 AA347236 AW961686 AW1 76446 

AA304671 AW583735 T61714 AA316968AI446615AA343532 AA083489AA488005 W52095 W39480 N57402 D82638 W25540 W52847 
D82729 D58990 BE619182 AA315188 AA308636 M112474 W76162 AA088544 H52265 AA301631 H80982 AA113786 BE620997 AW651691 
AA343799 BE613669 BE547180 BE546656 F11933 AA376800 AW239185 AA376086 BE544387 BE619041 AA452515 AA001806 AA190873 
AA180483 AA159546 F00242 AI940609 AI940602 Ah 89753 T97663 T661 10 AW062896 AW06291 0 AWO62902 AI051622 AI828930 AA102452 
AI685095 AI819390 AA557597 AA383220 A1804422 A1633575 AW338147 AW603423 AW606800 AW750567 AW510672 AI250777 AA083510 
AW529109 AW513200 AA921353 AI677934 AI148698 AI955858 AA173825 AA453027 AI027865 AW375542 AA454099 AA733014 AI591384 
R79300 R80023 AA843108 AA62605B AA844898 AW375550 AA88901 8 M474275 AW205937 A105227Q AW3881 17 AW3881 1 1 AA699452 
AI242230 N47476 H38178 AA366521 AA1 13196 AA130023 H39740 T61629 AI885973 AW083671 AA179730 AA305757 AI285455 N83956 
AA216013 AA3361 55 AW999959 T97525 AA345349 T91 762 AA771981 AI285092 AI591385 BE3924B6 BE385852 AA682601 AI682884 
AA345840 T85477 AA292949 AA932079 AA098791 D82607 T48574 AW752038 C06300 

438091 44964.1 AW373062 T55662 A1299190 BE174210 AW5790O1 H01B1 1 W40186 R67100 AI923886 AW952164 AA628440 AW898607 AW898616 

AA709126 AW898628 AW898544 AA947932 AW898625 AW898622 AI276125 Al 1 85720 AW51 0698 AA987230 T52522 BE467708 AW243400 
AW043642 AI288245 AJ1 86932 D52654 D55017 D52715 D52477 D53933 D54679 AI298739 A1146984 AI922204 N98343 BE174213 AA845571 
AI813854 AI214518 AJ635262 AI139455 A)707807 AJ698085 AW884528 AJ024768 AJ004723 AW087420 AI565133 N94964 AI268S39 
AW51 3280 AI061 126 AJ435818 A1859106 AI350506 AI024767 AA513019 AA757598 X56196 AA902959 AI334784 M860794 AA010207 
AW890091 AW51 3771 AI951 391 AI337671 T52499 AA890205 AI640908 H75966 AA453487 AA358688 AI961767 A1866295 AA780994 
A198591 3 BE174196 AA029094 AW5921 59 T55581 N79072 AI61 1201 AA910812 AI220713 AW149306 A1758412 AA045713 R79750 N76096 

TABLE 6C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gf) numbers. "Dunham I. et al.* refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham l. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400750 


8119067 


Plus 


198991-199168,199316-199548 


400843 


9188605 


Plus 


5863-59707653-7784,8892-9023.9673-9807^0634-10789^5254-15403,23827-23958 


401093 


6516137 


Minus 


22335-23166 


401192 


9719502 


Minus 


69559-70101 


401451 


6634068 


Minus 


119926-121272 


401780 


7249190 


Minus 


28397-2861 7,28920-29045^91 35-29296,2941 1-29567,29705-29787,30224-30573 


401781 


7249190 


Minus 


83215-83435,83531^656,83740^3901,84237-84393.84955-85037,86290-86814 


402230 


9966312 


Minus 


29782-29932 


402727 


9211324 


Plus 


54595-54777 


404875 


9801324 


Pius 


96588-96732:97722-97831 


404977 


3738341 


Minus 


43081-43229 


405033 


7107731 


Minus 


142358-142546 


405506 


6466489 


Plus 


8001 4-80401 ,80593-81 1 25 
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TABLE 7A: Genes downregulated in bladder cancer _ 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 90th percentile of normal urolhefium biopsy Ais divided by 75th percentile of bladder tumor Ais 

R2 90th percentile of normal urotheilum biopsy and normal Madder Als divided by the 90th percentile bladder tumor Als 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


403010 




C21000152:gi|6226483|sp|G5211B|YMO3_ERWS 4.86 


149 


426798 


S78234 


Hs.172405 


cell division cycle 27 


4.28 


2.48 


416225 


AA577730 


Hs.188684 


ESTs, Weakly similar to PC4259 ferritin 


4.04 


107 


459006 


AW298631 


Hs.27721 


Wolf-Hirschhorn syndrome candidate 1-lik 


3.82 


166 


404917 






Target Exon 


3.78 


2.00 


426488 


X03350 


Hs.4 


alcohol dehydrogenase 1B (class I), beta 


264 


1.79 


419543 


AA244170 




gb:nc05h02.s1 NCLCGAPJM Homo sapiens 


2.63 


3.42 


453180 


N46243 


Hs.1 10373 


ESTs, Highly similar to T42626 secreted 


2.32 


3.24 


428957 


NMJJ03881 


Hs.194679 


WNT1 inducible signaling pathway protein 


120 


3.60 


451529 


AI917901 


Hs.208541 


ESTs 


Z18 


3.69 


417076 


AW973454 


Hs.238442 


ESTs, Moderately similar to ALU7_HUMAN A 


103 


3.03 


425438 


T62216 


Hs.270840 


ESTs 


2.00 


5.17 


450515 


AW304226 




biphenyl hydrolase-lixe (serine hydrolas 


1.89 


2.46 


432873 


AW837268 


Hs.279639 


Homo sapiens mRNA; cONA DKFZp586M2022 (f 


1.70 


2.79 


452123 


AI267615 


Hs.38022 


ESTs 


1.69 


2.46 


424378 


W28020 


Hs.167988 


neural cell adhesion molecule 1 


1.65 


4.67 


437601 


AA761546 


Hs.248844 


ESTs, Weakly similar to ALU1_HUMAN ALU S 


1.50 


3.34 


402096 






ENSP00000217725*:Laminin alpha-1 chain p 


1.48 


3.02 


439563 


A1018768 


Hs.12482 


glyceronephosphate O-acyttransferase 


1.47 


3.22 


412810 


M21574 


Hs.74615 


platelet-derived growth factor receptor, 


1.46 


2.30 


458651 


AW6 12481 


Hs.104105 


ESTs 


1.39 


2.89 


414033 


AL079707 


Hs.207443 


hypothetical protein MGC1 0848 


1.36 


180 


433572 


AL046859 


Hs.3407 


protein kinase (cAMP-dependent catalyti 


1.35 


3.49 


413305 


NMJ00426 


Hs.323511 


Homo sapiens cDNA: FU23176 fis, done L 


1.34 


2.93 


420412 


AW976674 


Hs.125103 


ESTs 


1.32 


5.13 


421406 


AF179897 


Hs.104105 


Meis (mouse) homolog 2 


1.31 


4.07 


417446 


AL118671 


Hs.82163 


monoamine oxidase B 


1.27 


186 


452885 


AI478250 


Hs.13751 


ESTs 


1.26 


1.95 


446808 


AA703226 


Hs.16193 


Homo sapiens mRNA; cONA DKF2p586B21 1 (fr 


1.25 


3.44 


443105 


X96753 


Hs.9004 


chondroitin sulfate proteoglycan 4 (mela 


1.24 


2.07 


421348 


M94048 


Hs.103724 


peripheral myelin protein 22 


1.24 


2.63 


433070 


N75346 


Hs.306121 


COC20 (cell division cycle 20, S. cerevi 


1.23 


180 


420059 


AF161486 


Hs.94769 


RAB23, member RAS oncogene family 


1.22 


3.43 


408491 


AI088063 


Hs.7882 


ESTs 


1.20 


6.01 


447384 


AI377221 


Hs.40528 


ESTs 


1.00 


7.92 


421998 


R74441 


Hs.117176 


po!y(A)-binding protein, nuclear 1 


1.00 


7.38 


409619 


AK001015 


Hs.55220 


BCL2-associated aihanogene 2 


1.00 


6.40 


444795 


AI193356 


Hs.160316 


ESTs 


1.00 


5.53 


408495 


W68796 


Hs.237731 


ESTs 


1.00 


5.05 


417124 


BE122762 


Hs.25338 


ESTs 


1.00 


4.73 


443998 


AI620661 


Hs.296276 


ESTs 


1.00 


4.39 


406303 






C16000922:gi|7499103|pir|[T20903 hypothe 


1.00 


4.37 


422994 


AW891802 


Hs.296276 


ESTs 


1.00 


4.37 


422195 


AB007903 


Hs.1 13082 


KIAA0443 gene product 


1.00 


4.35 


452877 


A1250789 


Hs.32478 


ESTs 


1.00 


3.90 


452487 


AW207659 


Hs.6630 


Homo sapiens cONA FU13329 fis, done OV 


1.00 


3.90 


417159 


R01761 




gb:ye81f10.s1 Soares fetal Over spleen 


1.00 


3.82 


445607 


AA488107 


Hs.30156 


ESTs, Weakly similar to unnamed protein 


1.00 


3.62 


406274 






Target Exon 


1.00 


3.59 


410611 


AW954134 


*Hs.20924 


K1AA1628 protein 


1.00 


3.06 


426495 


NMJ501151 


Hs.2043 


solute carrier family 25 (mitochondrial 


1.00 


2.89 


422292 


A1815733 


Hs.114360 


transforming growth factor beta-stimulat 


1.00 


2.61 


413040 


AA193338 


Hs.12321 


sodium caldum exchanger 


1.00 


2.51 


429623 


NM.005308 


Hs.211569 


G protein-coupled receptor kinase 5 


1.00 


105 


456607 


AI660190 


Hs.106070 


cyclirv-dependent kinase inhibitor 1C (p5 


1.00 


2.01 


429143 


AA333327 


Hs.197335 


plasma glutamate carboxypeptidase 


0.97 


145 


400288 


X06256 


Hs.149609 


integrin, alpha 5 (fibroneclin receptor, 


0.90 


2.47 


442498 


U54617 


Hs.8364 


Homo sapiens pyruvate dehydrogenase kina 


0.88 


5.08 


414449 


AA557660 


Hs.76152 


decorin 


0.88 


3.13 


412014 


A1620650 


Hs.43761 


ESTs, Weakly similar to A46010 X-tinked 


0.78 


1.88 


425100 


AFO51850 


Hs.154567 


superviDin 


0.70 


3.90 


432094 


A1658580 


Hs.61426 


Homo sapiens mesenchymal stem cell prote 


0.68 


3.41 


427818 


AW511222 


Hs.193765 


ESTs 


0.63 


3.75 
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Pkey: Unique Eos probeset identifier number 
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CAT number Gene duster number 

Genbank accession numbers 



Ptey CAT Number Accession 

417159 1653899J R01761 R01760 N49787 

419543 185745.1 AA244170AI0180B7AA244355 

450515 83710J AW304226 AW008420 AA349212 H15015 AA317021 AI829484 H25661 H81744 AI906147 AA837938 AW167766 AW603578 AW842369 

BE439926 AA902417 AW235409 AA010062 AW069319 AI280242 AW672925 H06848 H05608 R51905 R45023 AW675471 H28475 AI086597 
AI197815 AI825355 N99134 AI075956 AI4701 22 AA449985 AW662833 AAB6M23 AA91 3342 T23825 AI394207 AI310319 T32467 A1589870 
AI682293 AI810633 BE223045 H14620 AA626645 AA875023 T33571 AA953982 AM 38631 H15016 AI304356 AA983S31 A1350990 AI143993 
AI708171 AA525961 H26247 W38485 AA847598 H81 745 AW855486 BE299605 AI079409 AI278050 AI2231 68 AI860904 AW02541 5 AJ339003 
AA393692 A1354302 AI492838 N80194 A101 5651 N34543 BE295397 AI085154 W24135 



TABLE 7C 



Pkey: 
Ref: 

Strand: 
Nt_posilion; 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank identifier (Gl) numbers. "Dunham L et a).' refers to the publication entitled The DMA 
sequence of human chromosome 22." Dunham 1. et a!., Nature (1 999) 402:489495. 
Indicates DNA strand from which exons were predicted . 
Indicates nudeotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


402096 


8117697 


Minus 


24993-25186 


403010 


3132346 


Plus 


78385-79052 


404917 


7341851 


Plus 


49330-49498 


406274 


7543787 


Plus 


932-1123 


406303 


8575868 


Plus 


173622-173786 
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TABLE 8A: Genes predictive of bladder cancer progression , 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unlgene number 

Unigene Title: Unigene gene title 

R1 80lh percentile of Ta or T1 tumor Als from patients who upstaged divided by 80th percentile of Ta or T1 tumor Als from patients who did not upstage 
R2 median of Ta or T1 tumor Als from patients who upstaged divided by the median of Ta or T1 tumor Als from patients who did not upstage 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


413324 


V00571 


Hs.75294 


corticotropin releasing hormone 


8.30 


4.16 


437802 


AI475995 


Hs.122910 


ESTs 


7.51 


2.50 


444444 


A1149332 


Hs.14855 


ESTs 


2.58 


1.38 


445033 


AV652402 


Hs.72901 


mucin 13, epithelial transmembrane 


Z26 


1.13 


417771 


AA804698 


Hs.82547 


retlnoic acid receptor responder (tazaro 


3.27 


5.33 


449618 


A1076459 


Hs.15978 


KIAA1272 protein 


2.70 


3.33 


407242 


M18728 




gb:Human nonspecific crossreacling antig 


3.58 


1.90 


418318 


U47732 


Hs.84072 


transmembrane 4 superfamily member 3 


3.32 


1.00 


423441 


R66649 


Hs.278359 


absent in melanoma 1 like 


2.44 


2.40 


405033 






C1002652*:gi|544327|sp|Q04799|FMO5^RABIT 


1.75 


3.48 


423024 


AA593731 


Hs.325823 


ESTs, Moderately similar to ALU5.HUMAN A 


3.72 


1.28 


425118 


AU076611 


Hs.154672 


methylene tetrahydrofolate dehydrogenase 


2.40 


2.78 


437928 


NM 005476 


Hs.5920 


UDP-N-acetyigtucosamine-2-epimerase/tiac 


2.20 


1,53 


446584 


U53445 


Hs.15432 


downregulated in ovarian cancer 1 


1.38 


2.33 


436608 


AA628980 




down syndrome critical region protein DS 


3.32 


4.53 


404440 






NM_021048:Homo sapiens melanoma antigen, 


2.64 


1.00 


435602 


AF217515 


Hs.283532 


uncharacterized bone marrow protein BM03 


Z06 


1.09 


424098 


AF077374 


Hs.139322 


small proline-rich protein 3 


2.47 


3.64 


437553 


A1829935 


Hs.1 30497 


ESTs, Weakly similar to MAT8.HUMAN CHLOR 


2.09 


0.91 


428036 


AW068302 




Homo sapiens mRNA for caidesmon, J UTR 


2.71 


4.58 


446839 


BE091926 


Hs.16244 


mitotic spindle coiled-coS related prot 


3.18 


2.33 


448479 


H96115 


Hs.21293 


UDP-N-acteylg!ucosamine pyrophosphorytas 


2.61 


1.81 


412059 


AA317962 


Hs.249721 


ESTs, Moderately similar to PC4259 ferri 


1.90 


2.02 


401241 


AB028989 




mitogen-activated protein kinase 8 inter 


1.26 


2.55 


408741 


M73720 


Hs.646 


carboxypeptidase A3 (mast cell) 


2.12 


1.23 


415989 


AI267700 




ESTs 


1.60 


1.00 


431070 


AW408164 


Hs.249184 


transcription factor 1 9 (SC1) 


1.73 


1.50 


452140 


AB007928 


HsJ28169 


K1AA0459 protein 


2.44 


2.95 


443162 


T49951 


Hs.9029 


DKFZP434G032 protein 


2.98 


1.00 


435904 


AF261655 


Hs.8910 


1,2-atpha-rrtannosidase IC 


2.56 


1.20 


407379 


AA332127 


Hs.325804 


transcription factor 17 


2.10 


1.72 


442712 


BE465168 


Hs.131011 


ESTs 


2.54 


2.72 


411678 


A1907114 


Hs.71465 


squalene epoxidase 


1.12 


3.11 


406791 


AI220684 


Hs.347939 


hemoglobin, alpha 2 


1.69 


1.38 


431805 


NMJH4053 


Ns.270594 


FLVCR protein 


1.92 


2.05 


438414 


AA806794 


Hs.131511 


ESTs 


1.04 


2.15 


413924 


AL1 19964 


Hs.75616 


seladin-1 


1.69 


2.05 


437679 


NMJJ14214 


Hs.5753 


inositol(myo}-1 (or 4)-monophosphatase 2 


2.27 


2.26 


445911 


A1985987 


Hs.145645 


ESTs, Moderately similar to ALU1_HUMAN A 


1.42 


2.74 


408349 


BE546947 


Hs.44276 


homeo box C10 


1.60 


2.05 


422545 


X02761 


Hs.287820 


fibronectln 1 


1.77 


3.02 


406643 


N77976 


Hs.347939 


hemoglobin, alpha 2 


1.57 


1.35 


407228 


M25079 


Hs.155376 


hemoglobin, beta 


1.81 


1.50 


449644 


AW960707 


Hs.148324 


ESTs 


1.90 


3.19 


402305 






C19000735*:gi|4508027|ref|NPJ03414.1|z 


2.25 


1.49 


427683 


BE545490 


Hs.15053 


Homo sapiens HCMOGT-1 mRNA for sperm ant 


1.08 


2.25 


441690 


R81733 


Hs.33106 


ESTs 


1.80 


2.65 


434487 


AF143887 


Hs.337588 


ESTs, Moderately similar to S65657 alpha 


2.13 


1.63 


403362 






NM_001615*:Homo sapiens actin, gamma 2, 


2.33 


2.22 


445496 


AB007860 


Hs.12802 


development and differentiation enhancin 


1.12 


2.60 


425234 


AW152225 


Hs. 165909 


ESTs, WeaMy similar to I38022 hypoM 


1.83 


3.05 


402366 


AV648601 




apolipoprotein B (including Ag(x) anlige 


1.32 


2.05 


427254 


AL121523 


Hs.97774 


ESTs 


244 


1.00 


414533 


AA149060 


Hs.296100 


ESTs 


1.70 


2.06 


430157 


BE348706 


Hs.278543 


ESTs 


2.54 


3.00 


413433 


NM 003199 


Hs.326198 


transcription factor 4 


2.26 


1.41 


410532 


T53088 


Hs.155376 


hemoglobin, beta 

NM_005367:Homo sapiens melanoma antigen, 


1.52 


1.56 


405779 






2.42 


1.22 


450455 


AL117424 


Hs.25035 


chloride intracellular channel 4 


2.48 


2.13 


414081 


AW969976 


Hs.279009 


matrix Gta protein 


1.81 


1.53 


414761 


AU077228 


Hs.77256 


enhancer of zeste (Drosophila) homotog 2 


1.65 


2.08 


415062 


H45100 


Hs.49753 


uveal autoantigen with coiled coil domai 


1.62 


3.75 


406317 






C2002658*:gi|6625694|gblAAF19354.1|AF185 


1.68 


2.11 


453259 


R93125 


Hs.124187 


ESTs 


1.08 


2.25 


445937 


A1452943 


Hs.321231 


UDP-Gal:betaGlcNAc beta 1,4- galactosylt 


1.76 


1.01 


434370 


AF130988 


Hs.58346 


ectodysplasin 1, anhidrotic receptor 


1.12 


2.08 


41805B 


AW161552 


Hs.83381 


guanine nucleotide binding protein 11 


2.02 


Z40 



174 
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432331 


W37862' 


Hs.274368 


MSTP032 protein 


4.36 


218 


45173S 


AW080356 


Hs.23889 


ESTs, Weakly similar to ALU7_HUMAN ALU S 


1.51 


3.45 


41 31 09 


AW389845 


Hs.1 10855 


ESTs 


4.34 


5.98 


413643 


AA130987 . 


Hs.188727 


ESTs 


1.30 


242 


433217 


AB040914 


Hs.278628 


K1AA1481 protein 


1.70 


249 


435232 


NMJJ01262 


Hs.4854 


cyclbvdependenl kinase inhibitor 2C (p1 


1.79 


1.66 


438006 


BE148799 


Hs.127951 


hypothetical protein FU 14503 


1.76 


204 


410102 


AW248508 


Hs.279727 


• ESTs; homotogue of PEM-3 (Ciona savignyi 


232 


245 


433556 


AW974941 


Hs.292385 


ESTs, Weakly similar to I78885 serines 


1.14 


250 


415714 


NM.002290 


Hs.78672 


taminin, alpha 4 


2.52 


1.13 


450282 


AA007655 


Hs.93523 


ESTs 


1.40 


258 


442855 


AI074465 


Hs.1 33469 


ESTs 


1.54 


2.20 


432917 


NM_014125 


Hs.241517 


PRO0327 protein 


Z24 


3.03 


429041 


AJ132820 


Hs,19476B 


a (fisirttegrin and metaBoproteinase doma 


1.61 


260 


442807 


AL049274 


Hs.8736 


Homo sapiens mRNA; cONA DKF2p564H203 (fr 


1.73 


1.19 


427719 


AI393122 


Hs.134726 


ESTs 


1.46 


200 


408778 


AI500519 


Hs.63382 


hypothetical protein PR02714 


1.46 


258 


418870 


AF147204 


Hs.89414 


chemokine (C-X-C moBf), receptor 4 (fus 


1.40 


203 


424425 


AB031480 


Hs.146824 


SPR1 protein 


1.60 


1.11 


445391 


T92576 


Hs.191168 


ESTs 


1.69 


240 


446899 


NMJW5397 


Hs.16426 


podccatyxrn-fike 


1.22 


242 


420996 


AK001927 


Hs.100895 


hypothetical protein FU10462 


2.68 


298 


424909 


S78187 


Hs.153752 


ceO division cycle 25B 


2.18 


1.33 


413593 


AA205248 




gb:zq78c12r1 Stratagene hNT neuron (937 


1.69 


200 


408734 


AW264996 


Hs-254299 


ESTs 


106 


294 


413880 


A1660842 


Hs.1 10915 


interleukin 22 receptor 


1.24 


220 


437063 


AA351109 


Hs.5437 


Taxi (human T-cell leukemia virus type 1 


1.78 


260 


418044 


AI640532 


Hs.1 19830 


ESTs, Weakly similar to ALUFJ-IUMAN l!!l 


1.54 


253 


441971 


W27060 


Hs.265855 


ESTs 


1.62 


213 


450401 


AW959281 


Hs.8184 


ESTs 


1.42 


230 


440157 


AA868350 


Hs.343636 


ESTs 


1.38 


260 


457587 


AA992841 


Hs.27263 


K1AA1458 protein 


1.47 


242 


mm 


BE256751 


Hs.22867 


Homo sapiens cDNA: FU22073 fis, clone H 


1.18 


210 


402241 






Target Exon 


258 


252 


413428 


AA430155 


Hs.1 51343 


K1AA1524 protein 


1.22 


245 


416735 


R11275 


Hs.1 94485 


ESTs 


1.14 


214 


421582 


A1910275 




trefoil factor 1 (breast cancer, estroge 


1.25 


1.03 


431031 


AA830335 


Hs.1 05273 


ESTs 


235 


295 


433336 


AF017986 


Hs.31386 


secreted frizzled-related protein 2 


3.72 


1.00 


420786 


AW296466 


Hs.43628 


deleted in lymphocytic leukemia, 2 


1.23 


260 


401335 






Target Exon 


1.18 


1.68 


417670 


R07785 




gb:yf15c06j-1 Soares fetal liver spleen 


1.56 


200 


406314 






C14001020:gij12597441lgb|AAG60049.1lAF31 


1.60 


3.08 


458981 


AW968318 


Hs.285996 


hypothetical protein FU23375 


1.70 


250 


417509 


AA203414 


Hs.42009 


ESTs 


1.82 


205 


452732 


BE300078 


Hs.80449 


Homo sapiens, clone IMA6E:3535294, mRNA, 


1.34 


1.37 


418678 


NM.O01327 


Hs,87225 


cancer/testis antigen (NY-ESO-1) 


1.37 


1.02 


457871 


AJ168278 




ESTs 


1.20 


219 


444163 


AM 26098 




gb;qc54g07.x1 Soares_placenta_8to9weeks - 


1.43 


1.22 


413276 


Z24725 


Hs.75260 


mitogen inducible 2 


1.78 


228 


421097 


AI280112 


Hs.125232 


Homo sapiens cDNA FU13266 fis, clone OV 


255 


260 


417151 


AA194055 


Hs.293858 


ESTs 


1.68 


1.67 


453556 


AA425414 


Hs.33287 


nuclear factor \IB 


2.06 


240 


440859 


AW070865 


Hs.346390 


ESTs 


1.12 


1.70 


420629 


AW204343 


Hs.156823 


ESTs, Weakly similar to T30868 RhoA-bind 


1.21 


238 


422363 


T55979 


Hs.1 15474 


replication factor C (activator 1) 3 (38 


1.58 


215 


434831 


AA248060 


Hs.273397 


KIAA0710 gene product 


1.69 


1.78 


412055 


AA099907 


Hs.271806 


ESTs, Weakly similar to ALU1 HUMAN ALUS 


1.36 


265 


445468 


AW450439 




ESTs 


1.52 


250 


444550 


BE2507J6 


'Hs.87614 


ESTs 


1.30 


228 


417259 


AW9D3838 


Hs.81800 


chord roifin sulfate proteoglycan 2 (vers 


1.50 


3.02 


430233 


AW367902 


Hs.236443 


Homo sapiens mRNA; cDNA DKFZp564N1063 (f 


1.24 


295 


413444 


BE141019 




gb:MRO-HT0067-201099-G02-b10 HT0067 Homo 


"1.68 


280 


433844 


AA610175 


Hs.179647 


Homo sapiens cDNA FU12195 fis, clone MA 


1.49 


201 


427055 


AI301740 


Hs.173381 


tfhydropyrimidinase-Gke 2 


1.11 


258 


454244 


R51604 


Hs.300842 


K1AA1608 protein 


1.00 


202 


429503 


AA394183 


Hs.26873 


ESTs 


258 


4.08 


422940 


BE077458 




gb:RC1-BT0606-09050(H)15-b04 BT0606 Homo 


3.48 


246 


407949 


W21874 


Hs.247057 


ESTs, Weakly similar to 2109260A B cell 


3.13 


1.48 


437312 


AA809350 


Hs.246180 


ESTs 


1.10 


205 


449426 


T92251 


Hs.198882 


ESTs 


1.22 


208 


447620 


AW290951 


Hs.224965 


ESTs 


1.80 


218 


444700 


NM.003645 


Hs.1 1729 


fatty-acid- Coenzyme A Dgase, very long- 


1.72 


281 


436258 


AW867491 


Hs.107125 


plasmaJemma vesicle associated protein 


1.99 


1.80 


415712 


AW249188 


Hs.1 69577 


Homo sapiens cDNA FU14743 fis, clone NT 


1.66 


298 


432877 


AW974111 


Hs.292477 


ESTs 


1.63 


279 


412085 


AW891667 




gb:CM3^0089-110500-179-h09 NT0089 Homo 


1.40 


208 



175 
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431421 


AW969118 


1 A An 4 A A 

Hs.108144 


ESTs, Weakly similar to unnamed protein 


1.96 


3.58 


4U9714 


AW367B12 


Hs. 199961 


ESTs, Weakly similar to AIU7_HUMAN ALU S 


1.24 




4*3U13 


AW&75443 


Hs.22209 


secreted modular calcium-binding protein 


4. AO 

1.49 


0 no 


422663 


AW500087 


114 4 AA 4 A 

Hs.119014 


zinc linger protein 175 


1.01 




439737 


* ivpi 4 AA 

AI751438 


Hs.41271 


Homo sapiens mRNA full length Insert cDN 


1.54 


2.57 


413196 


AA1 27386 




gb:zn90d09.r1 Stratagene lung carcinoma 


1.04 


1 40 

2.18 


439349 


AJ660898 


Hs. 195602 


ESTs 


Z03 


2.43 


443005 


a inn A o A 

AI027184 


Hs.200918 


ESTs 


1.42 


o 4n 


424762 


AL1 19442 


Hs. 183884 


eukaryotic translation initiation factor 


2.56 


O A1 

3.43 


427373 


AB007972 


Hs. 130760 


myosin phosphatase, target subunil 2 


Z08 


4 7f\ 


413916 


N49813 


Hs.75615 


apolipoproteln OH 


0.70 


U.42 


41B332 


R34976 


Hs.78293 


ESTS 


O ~IA 

2.74 


Z.43 


426552 


BE297660 


Hs.1 70328 


moesin 


1.28 




ACCCQ1 
4Mb 00 


Arlr9o97 


Hs.104105 


Mels (mouse) homolog 2 


A A*) 




447214 


AI367288 


1 1. A"T*5tf A4 

Hs.273621 


Homo sapiens cDNA; PU21350 ns, clone L 


1.14 


O 40 
4.10 


AA&IRA 

449254 


W2o9QB 


Hs. 172762 


to IS 






443651 


»AW*4 CO 

W22152 


Hs.282929 


ESTs 


O 0/4 

2.94 


i nn 


4*1021 


a AOnonio 
AAB08018 


ii. 4 noofto 
Hs. 109302 


CCTe 

to IS 


i A*. 

1.41 


*.U4 


419741 


kit* nA7A<n 

NMJJ07Q19 


u_ no Aft o 

Hs.93002 


ubtcjuitin carrier protein E2-C 


4 C4 
1.61 


1 01 


432027 


AL096678 


Hs.272353 


I/IAAAOC7 . » 

KIAAQ957 protein 


1.70 


£.10 


452680 


A A 704 4 Jft 

AA721 140 


Hs.49930 


ESTs, Weakly similar to putative p loo (H 


4 on 


^.90 


417042 


C75563 


Hs.1 13029 


ribosomal protein S25 


2.22 


3.20 


443574 


U83993 


Hs.321709 


purinergic receptor P2X, tigand^ated io 


1.21 


O C1 


429372 


AA451859 


Hs.99253 


ESTs 


1.28 


2.43 


424290 


AA338396 




gb:EST43386 Fetal brain 1 Homo sapiens c 


1,82 


2.03 


428518 


AW969656 




gb:EST3B1733 MAGE resequences, MACK Homo 


1.72 


1 CO 

2.52 


455649 


6E065051 




-L.nftH DTnOHO 4 AftCftA fill »ft J OTAOIO U^mn 

go:RC1-BT0313-1 1050XMJ17-CU4 B 1031 3 Homo 


4 CC 

1.00 


3.U3 


414665 


A A 4f A070 

AA160873 


Hs.332053 


serum amyloid A1 


4 *3C 

1.3b 


i no 

l.Uo 


418298 


AA256014 


Hs.86682 


Homo sapiens cDNA: FU2 1578 ns, clone 0 


4 f\A 
1.04 


0 Fit 


429655 


U48959 


1 1— ft 4 4 f*ft A 

Hs.211582 


myosin, light polypeptide kinase 


4.94 


A *iA 

4.34 


433924 


AA618304 


Hs. 258785 


ESTs 


4 AA 
1.44 


O /ft 


452683 


AI089575 


Hs.9071 


progesterone membrane binding protein 


1.48 


2.48 


439437 


AI207788 


1 4 J ft Art 

Hs. 343628 


sialyltransferase 4B (beta-galactosidase 


1.36 


O 00 

2.33 


432314 


A a r^ii jt 

AA533447 


i 1 ft 4 ft ft Oft 

Hs. 3 12989 


ESTs 


0.96 


O 70 

2.78 


400881 






NM_025080:Homo sapiens hypothetical prot 


1.70 


3.15 


426477 


AA379464 




gb:EST92386 Skin tumor 1 Homo sapiens cO 


2.01 


2.37 


454741 


BE1 54396 




_L .Ol Jft ITT Aft A ft AA4 AAft APA LAP HTAA A ft ll^^ _ _ 

gb:CM2-m 0342-091 299-050-005 HT0342 Homo 


2.12 


O AA 

3.44 


423977 


AA333232 




gb:EST37283 Embryo, 8 week i Homo sapien 


1.38 


2.13 


450396 


AU077002 


Hs.24950 


regulator of G-protein signalling 5 


2.13 


3.28 


431842 


NM_005764 


Hs.271473 


epithelial protein up-regulated in card 


1.90 


2.23 


415157 


D63257 




gb:HUM514B08B Clontech human placenta po 


1.29 


2.90 


418236 


AW994005 


Hs.337534 


ESTs 


1.74 


2.37 


454390 


AB020713 


Hs.56966 


KIAA0906 protein 


1.47 


1.38 


436143 


A A "Frt eft a r 

AA705245 


Hs.192189 


ESTs 


4 AC 

1.40 


O AC 

£.40 


436251 


BE515065 


Hs.296585 


nucleolar protein (KKE/D repeat) 


1.43 


2.07 


450735 


AI732321 




SRY (sex determining region Y)-box 4 


1.36 


2.02 


420136 


A t i /ft A 4 fWWl 

AW801090 


Hs.1 95851 


actin, alpha 2, smooth muscle, aorta 


2.70 


4 CO 
1.00 


447100 


AI361801 


II 4 f>«7 4 ft A 

Hs.1 67130 


hypothetical protein 


1.66 


o no 
2.03 


453577 


a I a ion a/\ 

AL043049 




gb:DKFZp434A1523_r1 434 (synonym: ntes3) 


1.41 


O 7C 

2.75 


408522 


AI541214 


Hs.46320 


Small proline-rich protein SPRK [human, 


4 CO 
1.39 


1 "JQ 
I.JO 


412632 


AL1 20379 


1 1. ?JOA A 

Hs.74294 


aldehyde dehydrogenase 7 family, member 


1.81 


2.51 


423291 


in i aaa 4 ftA 

NM_0O4129 


Hs. 126590 


guanylate cyclase 1, soluble, beta 2 


1.54 


O 00 

£.03 


456172 


R99050 




gb:yq65c02.M Scares fetal liver spleen 


1.46 


2.68 


452123 


AI267615 


1 1 ft AAA A 

Hs.38022 


ESTs 


1.24 


1.93 


433900 


AA721668 


Hs.257761 


ESTs 


1.78 


2.66 


408436 


R31954 


Hs.7885 


phosphatidylinositol binding dathrin as 


1.21 


o oc 
235 


417123 


BE326521 


11—4 PA A PA 

Hs.1 59450 


ESTs 


1.46 


2.09 


436023 


T81819 


LJj» ft A AAf* 4 

Hs.302251 


ESTs 


O CD 

I. bo 


O 7ft 


454150 


AA131893 


11—4 r* A AAA 

Hs. 154088 


hypothetical protein FU22756 


1.40 


2.50 


444094 


AI695764 


t 1- AA Aft A A 

HS.202394 


ESTs 


1.28 


A ftO 

4.U3 


429176 


AW975021 


t_J«* 4 Aft ft AA 

Hs.1 93800 


ESTs 


1.08 


O CO 


422259 


AA307584 




gb:EST1 78498 Colon carcinoma (HCC) cell 


1.80 


3.03 


451164 


A A f\A CA40 

AA015912 


Hs.60659 


ESTs, Weakly similar to T46471 nypoinetj 


1.47 


1 17 
1.1 / 


417501 


AL041219 


Ma Oft AAA 

Hs.82222 


sema domain, Immunogtobuftn domain (Ig), 


2.92 


2.70 




DuUUJjUJ 




nh , OV^RTn9flR-14n9nfLnn , i.h(11 RTOMfi Hnmn 
yu. \J v o-o 1 Utoo- \ HUtWMJO*J-iiu i o i utsu nun nj 


1.70 


2.70 


443387 


BE139135 


Hs.254629 


ESTs 


1^48 


2.32 


420995 


AA282495 


Hs.89014 


ESTs 


1.45 


1.51 


407329 


AA576061 


Hs.269834 


ESTs, Weakly similar to ALUD HUMAN 111! 


1.13 


2.38 


438797 


C16161 


Hs.283040 


hypothetical protein PR02543 


0.99 


Z75 


443357 


AW016773 




low molecular mass ublquinone-binding pr 


1.60 


2.08 


412656 


AF006011 


Hs.74375 


dishevelled 1 (homologous to Drosophila 


1.32 


1.13 


427377 


AU077029 


Hs,177543 


antigen identified by monocional antibod 


1.24 


0.79 


412200 


R08110 


Hs.1 87462 


ESTs, Weakly similar to I38022 hypotheU 


1.35 


1.54 


432586 


AA568548 




ESTs 


1.50 


2.25 


411590 


T96183 




gb:ye09f07.s1 Stratagene lung (937210) H 


1.22 


£53 


422672 


X12784 


Hs.1 19129 


collagen, type IV, alpha 1 


2.27 


Z20 


420256 


U84722 


Hs.76206 


cadherin 5, type 2, VE-cadherin (vascula 


1.59 


Z11 
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419900 


AI459960 


Hs.1 70898 


ESTs 


1.30 


2.68 


410805 


AW804742 


Hs.84264 


acidic protein rich in leucines 


1.16 


2.28 


452560 


BE077084 


Hs.99969 


ESTs 


1.44 


2.58 


448429 


D 17408 


Hs.21223 


caJponin 1, basic, smooth muscle 


4.02 


1.00 


424436 


AW818428 


Hs.4953 


gofgi autoantigen, gotgin subfamily a, 3 


1.10 


2.00 


447400 


AK0OO322 


Hs.18457 


hypothetical protein RJ20315 


1.45 


1.28 


422522 


AI023428 


Hs.34549 


ESTs, Highly similar to S94541 1 clone 4 


1.82 


2.10 


443696 


AW607444 


Hs.134622 


ESTs 


1.98 


2.01 


436094 


A/798701 


Hs.222222 


ESTs 


1.34 


ZAO 


420168 


AF217508 


hfe,95594 


serine carboxy peptidase viteKogenic-fik 


1.58 


Z45 


430325 


AF004562 


Hs.239356 


syntaxin binding protein 1 


1.34 


2.43 


439022 


AA356599 


Hs.173904 


ESTs 


2.76 


2.40 


420563 


AA278327 


Hs.1 36237 


ESTs, Moderately similar to Y140 HUMAN H 


1.78 


2.65 


429494 


AA769365 


Hs.126058 


ESTs 


1.50 


2.40 


420689 


H79979 


Hs.88678 


ESTs 


1.26 


2.28 


448988 
439943 


Y09763 


As.22785 


gamma-aminobutyric acid (GABA) A recepto 


1.72 


1.29 


AW083789 


Hs.124620 


ESTs 


1.45 


Z84 


442300 


AI765908 


Hs.129166 


ESTs 


1.24 


135 


449614 


A1989490 


Hs.197703 


ESTs 


1.12 


172 


444363 


AI142827 


Hs.143656 


ESTs 


1.32 


2.08 


424479 


AF064238 


Hs.149098 


smoothe!in 


1.59 


1.10 


437321 


AA768966 


Hs.292026 


ESTs. Weakly similar to 210926QA B cell 


1.28 


2.07 


431926 


AW972724 




gb:EST384816 MAGE resequences. MAGL Homo 


1.52 


2.63 


433640 


AW390125 


Hs.240443 


Homo sapiens cDNA: FU 23538 fis, clone L 


1.69 


1,30 


415901 


H08396 


Hs.76118 


ubiquitin carboxyt-terminal esterase LI 


1.71 


2.19 


437199 


AL1 10175 


Hs.306337 


Homo sapiens mRNA; cDNA DKF2p564H0616 (f 


1.48 


2.50 


457450 


AW294163 


Hs.146127 


ESTs 


1.07 


2.60 


456678 


AF141305 


Hs.173736 


ancient ubiquitous protein 1 


1.44 


135 


451079 


AI827988 


Hs.240728 


ESTs. Moderately similar to PC4259 fern 


0.95 


3.00 


405944 






Target Exon 


1.48 


145 


408877 


AA479033 


Hs.130315 


ESTs, Weakly similar to A47582 B-cell gr 


1.38 


120 


446682 


AW205632 


Hs.211198 


ESTs 


1.38 


3.00 


431380 


AW610282 


Hs.291003 


ESTs 


1.43 


164 


442027 


AI652926 


Hs.128395 


ESTs 


1.18 


143 


\ 423578 


AW960454 


Hs.222830 


ESTs 


1.56 


2.18 


441495 


AW294603 


• Hs.127039 


ESTs 


£80 


1.73 


417900 


BE250127 


Hs.82906 


CDC20 (ceD division cycle 20, S. cerevi 


1.36 


1.18 


443949 


AW827419 


Hs.235070 


ESTs 


1.30 


128 


440495 


AA887212 


Hs.14161 


hypothetical protein DKFZp434H930 


1.74 


2.78 


449948 


R19156 


Hs.20798 


ESTs 


1.12 


2.23 


439564 


W77911 


Hs.110006 


ESTs 


1.34 


2.85 


423225 


AA852604 


Hs.1 25359 


Thy-1 cell surface antigen 


1.24 


1.09 


436139 


AA755786 


Hs.120936 


ESTs 


1.30 


110 


456968 


AI174861 


Hs.190623 


ESTs 


1.14 


115 


437191 


NM 006846 


Hs.331555 


serine protease inhibitor, Kazat type, 5 


\2\ 


118 


411652 


AW855393 




gb:CM3-CTG275-191099-Q24-f10 CT0275 Homo 


1.85 


1.94 


420732 


M789133 


Hs.88650 


ESTs 


1.66 


2.71 


409291 


AW373472 




gb:RC3-BT0523-181299-011-d12 BT0523Homo 


1.56 


130 


424415 


NMJJ01975 


Hs.146580 


enolase X (gamma, neuronal) 


1.56 


1.39 


424927 


AW973666 


Hs.1 53850 


hypothetical protein C321D2.4 


1.58 


1.24 


450946 


AA374569 


Hs.1 27698 


ESTs, Moderately similar to 2109260A B c 


1.02 


125 


428423 


AU076517 


Hs.184276 


solute carrier family 9 (sodium/hydrogen 


1.70 


2.39 


415361 


F06724 




gb:HSC1IG021 normalized infant brain cDN 


1.34 


2.40 


406490 






C5001926:gi|7511572|pirj|T42245 probable 


1.28 


140 


410855 


X97795 


Hs.66718 


RAD54 (S.cerevisiaeHike 


1.52 


100 


440010 


AA534930 


Hs.127236 


hypothetical protein FU 12879 


1.12 


120 


429508 


AW369620 


Hs.33944 


ESTs, Weakly similar to ALU1 HUMAN ALUS 


1,33 


128 


426340 


Z97989 


Hs.169370 


FYN oncogene related to SRC; FGR, YES 


1.88 


118 


416889 


AW250318 


Hs.80395 


mal, T-ceU differentiation protein 


1.65 


1.16 


451870 


AI820991 


Hs.8377 


ESTs 


1.24 


2.03 


444091 


AV647924 


Hs.282376 


ESTs 


1.05 


2.13 


410793 


AW581906 


Hs.66392 


intersectin 1 (SH3 domain protein) 


2.00 


3.13 


452222 


AW806287 


Hs.21432 


SEX gene 


1.25 


1.10 


433010 


AW970018 




gb:EST382097 MAGE resequences. MAGK Homo 


1.36 


141 


432674 


AA641092 


Hs.257339 


ESTs, Weakly similar to 138022 hypotheti 


1.14 


103 


438855 


AW946276 


Hs.6441 


Homo sapiens mRNA; cDNA DKFZp586J021 (tr 


2.24 


1.77 


448718 


AA220235 


Hs.153959 


hypothetical protein MGC15436 


1.52 


2.65 


402685 






Target Exon 


2.04 


2.46 


424528 


AW073971 


Hs.238954 


ESTs, Weakly similar to KIAA1 204 protein 


1.66 


2.05 


422068 


A1807519 


Hs.104520 


Homo sapiens cONA FU 13694 fis, done PL 


1.69 


4.98 


451225 


A1433694 


Hs.293608 


ESTs 


1.79 


170 


441078 


AI453268 


Hs.323409 


Homo sapiens cDNA FU14113 fis, clone MA 


1.44 


158 


409406 


H83092 


Hs.49605 


ESTs 


1.38 


2.05 


422297 


AW961290 




p30 DBC protein 


1.20 


173 


408711 


AW376061 


Hs.63335 


ESTs, Moderately similar to A46010 X-fin 


1.20 


108 


426696 


AW363332 


Hs.171844 


Homo sapiens cDNA: FU22296 fis, clone H 


1.35 


168 


417324 


AW265494 




ESTs 


1.68 


1.25 



177 



WO 03/003906 



PCT/US02/21338 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



tUO£00 


RP141R7Q 

DC !*♦ lOf 3 




nh-nV7 rlTnnfl , 4Jl719qQJl1R_hri'i UTnnfi^ Hnmn 


1.25 


2.65 


415166 


MM P.nififi9 


na. I OUOO 


CaTuuxypcpuOaSe i 


1.34 


1.09 


dnfiinn 

SUOOUU 






TbtqgI Exon 


1.61 


2.47 


A\ 1 QRfl 
H I lOOU 


AIA/R79A77 
AWuf Z4f f 




gD.nmjuiuj.xi iNUi^uvaAr^ iny*» nomo sapiens 


O.DU 


4.03 


A991Q7 


r IbJbO 


ns. iiw»d 


cyiocnroniB c oxiaase suDunii vua poiype 


0 1fi 
L. 10 


1 44 


499S.R7 


AP11117R 
AT II 1 I/O 


Ui 11Rin7 


gtypicsn 6 


1.57 


2.03 


HODO0D 


AA719R94 
AA/ 0£0£4 


rib. i qoodl 


CO IS 


1.08 


2.75 


MUOOOO 








0.93 


2.13 


447711 


AP1R74R9 
Mr 1 Of Hoc, 


ns. I3HUU 


MA 1*49 /m^tAlip arrpci Hofirionl upact h 

mtWJL \(f liLUuv dJI col UclILICIll, ycaol, [1 


1.18 


1.07 


4171 17 
*♦ J / 1 I / 


NIOI 10 






1.70 


2.85 


41 ifiQn 
«* 1 1 oau 


A A n RQ9nl 
AA003£00 




RMA 1 10 email nifHear 
IMNA, U£ wild] I nUWcof 


2.12 


2.78 


4419A1 
*WO£*tO 


AI4R9AQR 

AI40Z430 


ns. i Jiuoo 


F(5Te 

CO 1 9 


1 i<i 

1. 13 


2.83 


491A74 

4£OU/ 1 


Al 1 0QQR1 




P^U nrimarv roennncp fl RPR1 rati hnmnln 


1.37 


1.43 


A ACQ 4 C 


AW9QG.919 
A»f£30£0£ 


n5.*»t!7 


ATP ci/nthaco U Irancnnrfjnn m'titvhnrtHr 
rt i r &yfi« too a, n udflopUiuiiy, iIIKUCiiOnui 


1 R3 


2.23 


44Q7QQ 

•Ha/ ay 


Al1Al4fi.fi 
A! l**O**0O 


no. i^ouou 


F«5Te 

CO IS 


I.HU 


2.08 




TlfiQRA 




nh*MIR9fl7Q_(?R MnrmatiTpH Infant hrain R 


1.88 


1.85 




RPAAIiqc. 




PQTe WoaHu eimltar fn unlfnnu/n rvnloin 
CQIO| VVColUy OllliliCU VU uiiAJIUWII ptUlSJII 


1.57 


2,57 




AWQ7AQ4Q 
Avvy/ t i3*»y 


Ue 1RR^fi4 
ns, \ OCQD 4 * 


co la, vvedwy oinuidi iu luou^^ nypuutcu 


1 in 


2.30 


41 fill 1 
HIOOI 1 


uouc>£3 




nh-HIIUnfllHn^R Human fplal hrain fTPiiiiwn 
yu.nuiviuo i nuju nuinaii icidi uiaui ^irujiwd 


1.58 


4.35 


444bl4 


K444B4 


ns.z/ou 


hAlarnnanAAMp n 1 i/\]nnr riK/\nimlonnmlaln 

neierogeneous nuaear nDonucteuproiein 


1 fin 
1. 00 


9Q.fi 

£.90 


Acetone 


MM flflfifiQfi 
INIv^_UU0030 


ria.O 1 1 OL 


hicfamino M-mo(hwtfranefc>ra<P 
rUaldlJllllc IYn llclliyiUdJIolcidSc 


1.24 


2.08 


A1 acqi 

41UOOO 


a\a/77aioa 
AW/ /UZoU 




CCTc Unitaralaki eimllar ir*K91B /solan 

Co 1 s, Muucidieiy similar to jl>dzoo gaac 


1. 00 


4.33 


430410 


Ar0yyi44 


ns.JJ44bb 


Iryplase bets 1 


1 Q1 

i.y i 


1.00 




A A/1C4 OKK 




DADO It nmlain 

rvADy-u\u piuiein 


1 AO 


2.14 


432621 


AIZ98501 




CCTp \AfooHw ptmil-w In TjIAjIOQ Ki/nntKalS 

bo 1 s, weaxiy similar to 1 4b4zo nypomeu 


1 Afl 


1 OA 


441584 


AlAM A 0110 

AW1 48329 


HS.l/b^Uo 


COTp 

coTS 


1 11 
1.1a; 


1 An 


A/ covin 
44o;»4U 


ncn/tiQ 
UolKloO 


We 1477Q 


PCTe 
CO IS 


1.00 


9 7n 


400UZ4 


A Ami ACQ 

AAUJi4yy 


Ue 11R4RQ 

ns.i icwtjy 


CCTc 
CO IS 


9 n? 


1 7*» 

l.f 0 


444UU0 


. bho44000 


Uc 91RC70 


PQTe lAJoaWKj cimilar In QPPA Ul IMAM CPI \r 

co 1 s, weaKiy similar ro orrsH^nuMAn or 


1 <;A 


1 9Q 


44 £334 


AlfWft74ft 
AlUZb/lO 


Uc 1R(KA 


CCTc 
CO IS 


ifin 

J.OU 


17R 


4U£UO0 






O10UUV3U7 ,ylj^D<:r^ JO|yD|MrtoOOOOJ. I( (AT 


1 1R 


9 R1 


Any To 


A\AJQ7ft1K/l 




gu.rivw-r iuuiy-i*nzyy-uu3-ruo riuuiy norno 


inn 
z.uu 


9 Rl 
£.00 


4oobUo 


A\ASQ7C4CC. 

Awy/oibo 




go. tot oooc.iH mauc resequences, wiaoin nomo 


1 7R 




426106 


AI678765 


Up 04 (HO 


hols 


4 ^0 


9 11 

£.11 


JI1C1 11 

4431 01 


qcigiiia 


Ue QQ1CO 

tis.yy ibj 


CCTc 

cola 


«I.U** 


9 fin 
£.00 


44U0£0 


mm nnQOi? 

NM_UUOO 1 £ 


Uc 71RZ 

ns. / 1 04 


a □isimeynn anu meidiioproteii idbe uunid 


1.17 


2.55 


420447 


A AC071ftC 

AAbo/oUb 


Up QQjMQ 

Hs.oo44o 


COTp 
COlS 


1.00 


1 nfl 

£.00 


428055 


AA4/*U004 


Up 1(l17Rn 

ns. lUl/ou 


CCTc 

co IS 


1 OR 
I.UO 


9 

£. 10 


422110 


AIT7C70C 

A137b/Jb 


Up 4 A 4770 

ns. 1 11/ /b 


secreted protein, acidic, cysteine-rich 


4 7C 
1.10 


1 R9 
1.0Z 


438581 


AIA/0777CC 

AW97770D 


Up 1004 11 

ns.iyzijo 


CCTc k/tn^Ar^lnk> clmtl^r in 17RRQR conn 

co is, Mooeraiery similar 10 iroooo senn 


1 nn 

l.UO 


9 m 

£. IU 


403290 






P40/V\4n4 4*T l ;M7Rfl1111ro4'IKID nn AAM 11 A 

UlUUulUn .Ql|4/O0t l£|rel|Nr_UU4411.1j Q 


AQ7 


9 Afl 


408175 


lAnonQQ 

WzyOoy 


Up 4 once 


hwnrtlKn^pnl nmla'm nl^C7nRR7niA 1 R 

nypoifteiicai proiein ur\hz.pbo/u^4io 


1 AO 
1.4Z 




Aiiion 


AAQ1R4 77 
AM3O0 I / / 


Uc 97A4Rn 


nlfopfnn; raponlnr family R eiiKfamilu 

oiraciory recepiur, idrmiy o, buuidiimy 


1.26 


2.05 


>M1>M 4 

44o441 


Aw^ynyo 


Uc Q01QR 

ns.yziyo 


CCTc 
CO IS 


1 R9 


9 11 

£. 10 


/I 1 QOIG 

4i yyzo 


aai concn 


Uc Q17RR 

ns.yj/oo 


uporna nwoiLr lusiori panner 


1 79 


2.80 


445256 


AIQCBCIC 
AlODODJO 


Uc 1/JI7R1 
HS.144f OJ 


co IS 


i 07 


i n 

0.00 


456381 


A AIICCAft 

AAzJbbUb 




gD.zryyo lu.n iNW_^uAr_oupi norno sapiens 


1 1fi 
1.10 


1 Qn 

1. 30 


A11A11 
4££4JJ 


AAHflcfin 
AAJ IUOOU 


Uc 1R^74fi 


nypoincuCcU proiein ruj££*t9U 


1 nfi 

l.UO 


2.20 


AllftlQ 


Aiqoqcn7 
AJ3030U/ 


Uc IROO^C 


PCTe 
CO IS 


1.36 


2.25 


424951 


AW3b4UO£ 




nK-CCT17R1RR MACP racaflilonrat MAI^U Unmn 

QD.tb i j ( bioo wiAbc resequences, wiMvjn nomo 


9 99 
L.LC 


1 CD 

£.00 


420785 


HoSboo 


Ue 4Q4'3i4C 

HS.191o4b 


CCTp 

coTS 


4 1ft 


£. 10 


411347 


AIA/Q1Q41C 




nK-H\/1 1 TflflRI OATHrVLflQ? fM 1 TfifiRI Unmn 

gD.Uv£-L i uuo i-Z4uouu-uy » -iui liuudi nomo 


l.Oo 


9 1R 

£.00 


43B742 


AW£U414b 


Uc 4 0CCyl1 

ns.iybD4j 


CCTc 

colS 


1.1U 


9 in 

£.0U 


414900 


AW4o£4£i> 


Up 1>IBR7fi 


CCTp 

COlS 


1 A1 
Z.U1 


308 


443284 


AI1COQ4 1 

Al Jb9ol J 


Up C>I7Q1 


CCTp \AfnoMw pipiltm' fn TA07AC h«/nnlkali 

co i s, weawy similar 10 i hu uo nypouieu 


U.DO 


n4i 

U.40 


A AO n An 

402049 






Target Exon 


9 9fl 


ZOO 


429400 


AWbU494U 


Up 9A4CCO 

Hs./Uibbo 


transcription factor l\) (AKij 


4 4R 

1.1b 


i An 


423916 


Awyyo49D 


Up 4701C 

hs.l /^JO 


Homo ssptons clone i uuuiauui to itikna ssqu 


1 GO 

i.oy 


1 n^ 

l.UO 


432495 


A\AfQ71G17 


Up 4QC71^ 

HS.lobf J4 


to is, weaKiy sirraiar 10 10 1 fw pneromone 


1 CA 
l.OU 


9 n«; 

£.U0 


414840 


D07HO 


Up OIQOI 


nairy/ennancer-oi-spiu reiaiea wim t kk 


4 HQ 

1.0? 


9 no 

£.U? 


428711 


R46414 


Up CCQ1Q 

Hs.5bo2o 


trinucleotide repeat containing 5 


4 77 

1.77 


1 R1 
1.00 


448609 


AW1 39420 


Hs.7972 


1/IAAH074 mmImk 

KIAAOS/i proiein 


4 1>l 

1.14 


i it: 

£.£b 


443859 


mm /i no 
iMAyi^uiMuy 


Up OQ\A 

ns.yyi4 


lOiiisiaun 


1 17 

1.1 1 


1 On 
l.UO 


411141 


AIA/Q40CC1 

AWoiyobl 




-L.nrir CTrtOOl^ 4 JlAOAH A11_PAil CTAOOI Unmn 

gD;KUb-o i i4u*uu-uio-4jU4 0 1 u^yj Homo 


4 /A 

1.44 


9 An 

£.HU 


AAA116 
wui jo 


AI798851 


Uc 266959 


hflmfwilnhln nammfl G 


1.18 


2.08 


417944 


AU077196 


Hs.82985 


collagen, type V, alpha 2 


2.10 


1.37 


429640 


U83508 


Hs.2463 


angtopoietin 1 


1.92 


2.98 


410064 


X53416 


Hs.195464 


filambi A, alpha (actin-binding protein- 


1.51 


1.29 


458218 


AI435179 


Hs.126820 


ESTs 


1.49 


1.15 


443114 


AI033377 


Hs.153298 


ESTs 


1.38 


2.05 


427788 


AA412397 


Hs.1 16858 


ESTs 


1,45 


1.85 


435913 


W95006 


Hs.269559 


ESTs, Weakly similar to S65657 a!pha-1C- 


1.63 


3.90 


457949 


W69171 


Hs.71741 


ESTs, Highly similar to I38945 melanoma 


1.01 


ZOO 


419203 


AA488719 


Hs.190151 


ESTs 


1.94 


Z45 


412510 


AI056689 


Hs.133538 


ESTs. WeaWy similar to ALU1.HUMAN ALU S 


1.91 


Z20 


413885 


BE177442 




gb:RC1-HTO595.20O40(W12-fO1 HT0595Homo 


1.48 


Z80 


426239 


AA669615 


Hs.214226 


ESTs 


1.36 


2.50 



178 



WO 03/003906 



PCT/US02/21338 



10 
15 
20 
25 
30 
35 
40 
45. 
50 
55 
60 
65 
70 
75 



408886 


AW292096 


Hs.255036 


ESTs 


1.93 


Z92 


412857 


AI7Q3484 


Hs.128052 


ESTs 


172 


1.60 


427340 


BE167242 


Hs.47099 


hypothetical protein RJ21212 


1.46 


Z13 


412902 


BE008018 




gb:QVO£N0147-290400-214-c01 BN0147Homo 


0.90 


Z05 


451141 


AW772713 


Hs^47186 


ESTs 


2.38 


3.95 


412626 


AA1 14945 


Hs.151839 


ESTs 


1.75 


Z15 


405667 
417777 






Target Excn 


Z62 


3.79 


AI823763 


Hs.7055 


ESTs, Weakly similar to 178885 serlne/th 


1.24 


Z08 


401400 






Target Exon 


1.16 


1.90 


426795 


S78234 


Hs.172405 


cell division cycle 27 


2.14 


1.63 


435046 


AA662772 


Hs.174330 


ESTs. WeaWy simaar to ALU1.HUMAN ALU S 


1.14 


Z28 


448401 


A1498509 


Hs.346254 


ESTs 


2.50 


Z83 


450832 


AW970602 


Hs.105421 


ESTs 


0.55 


CX39 


441057 


A1043897 


Hs.126483 


ESTs 


1.08 


Z13 


438725 


AA815163 


Hs.1 27307 


ESTs 


1.31 


Z65 


450062 


AW001043 


Hs.200854 


ESTs 


1.30 


2.48 


441214 


AI820648 


Hs.129136 


ESTs 


1.43 


1.71 


431723 


AW058350 


Hs.16762 


Homo sapiens mRNA; cDNA DKF2p564B2062 (f 


1.22 


Z30 


414907 


X90725 


Hs.77597 


polo {DrosophiaHike kinase 


1.35 


1.49 


423622 


BE154847 




gb:PM1-HT0345-121199401-d05 KT0345 Homo 


1.57 


Z30 


450835 


BE262773 


Hs.25584 


hypothetical protein FU10767 


1.40 


1.12 


444014 


AI095718 


Hs.135015 


ESTs 


Z30 


1.78 


431603 


AA807955 


Hs.325984 


EST 


1.26 


Z03 


408597 


AW419069 


Hs.209670 


ESTs 


1.35 


Z60 


444312 


R44007 




ESTs 


1.95 


Z07 


404286 






C6001909:gip04441|dbjlBAA18909.1| (0298 


Z29 


Z22 


438813 


M27346 




gb:Homo sapiens (clone HGP09/HGP32) T ce 


1.03 


Z43 


445534 


AL038823 


Hs.12840 


Homo sapiens germline mRNA sequence 


1.00 


Z16 


426046 


AA833655 


Hs.206868 


Homo sapiens cDNA FU14056 fis, clone HE 


1.59 


Z73 


451907 


AI822065 


Hs.50749 


ESTs, Moderatery similar to ALU7_HUMAN A 


174 


Z65 


418796 


AA226351 


Hs.34060 


ESTs 


1.28 


Z12 


422431 


AJ769410 


* Hs.221461 


ESTs 


1.80 


3.58 


417557 


AA225622 


Hs.293589 


ESTs 


1.32 


Z14 


455313 


AW894409 


Hs.125472 


ESTs, Moderatery similar to KIAA0877 pro 


1.48 


2.57 


415479 


F10042 


Hs.4840 


ESTs 


1.83 


Z01 


450433 


AW444538 


Hs.231863 


ESTs 


1.31 


2.58 


410581 


AA018982 


Hs.125036 


tumor endothelial marker 7 precursor 


1.54 


1.62 


455407 


AW936813 




gb:PM2-DT0023-05040(W)03-b10 DT0023 Homo 


1.32 


2.15 


417552 


R00916 


Hs.166510 


ESTs 


1.50 


Z63 


428290 


AI932995 


Hs.183475 


Homo sapiens clone 25061 mRNA sequence 


1.94 


Z70 


432391 


A1732374 


Hs.339827 


Human DNA sequence from clone RP5-881L22 


0.96 


Z38 


456283 


U68162 


Hs.84171 


myeloproliferative leukemia virus oncoge 


1.22 


2.13 


438535 


L09078 




gbiHomo sapiens mRNA fragment 


Z14 


1.95 


416564 


AW795793 


Hs.2575 


Homo sapiens cDNA FU12257 fis, clone MA 


2.28 


1.93 


435200 


AA670310 


Hs.145903 


ESTs 


1.16 


Z13 


457635 


AV660976 


Hs.3569 


hypothetical protein 


1.37 


3.10 


444930 


BE185536 


Hs.301183 


molecule possessing ankyrin repeats indu 


0.99 


Z45 


449319 


AA373630 


Hs.188750 


ESTs 


1.56 


3.28 


418992 


AW074143 


Hs.87134 


ESTs 


1.88 


2.20 


409357 


AW382767 




gb:PM04fr0339481199-001-h05 HT0339 Homo 


1.30 


2.50 


434973 


AW449285 


Hs.313636 


EST 


1.11 


Z65 


408383 


BE466959 


Hs.144153 


ESTs 


1.30 


Z44 


440100 


BE382685 


Hs.158549 


ESTs, Weakly similar to T2D3 HUMAN TRANS 


1.23 


Z71 


431996 


AL1 22087 


i Hs.272304 


Homo sapiens mRNA; cDNA DKFZp564C0371 (f 


1.24 


2.27 


427681 


AB018263 


Hs.180338 


tumor necrosis factor receptor superfemi 


1.70 


1.68 


405146 






C8001690*:gi|6754446lrefjNP_034760.1j ki 


ZOO 


0.68 


436154 


AA764950 


Hs.1 19898 


ESTs 


1.43 


100 


451233 


AA047221 


Hs.59752 


ESTs 


1.38 


Z20 


446856 


AI814373 


Hs.164175 


ESTs 


1.33 


3.93 


448211 


BE384592 


Hs.6451 


PRO0659 protein 


1.48 


2.73 


4182B3 


S79895 


Hs.83942 


cathepsin K (pycnodysostosis) 


1.33 


Z68 


409609 


AW444670 


Hs.335685 


ESTs 


1.27 


1.51 


450414 


AI907735 


Hs.21446 


KIAA1716 protein 


1.60 


1.24 


452929 


AW954938 


Hs.172816 


neuregulin 1 


2.01 


3.70 


435112 


AW976145 


Hs.143198 


inhibitor of growth family, member 3 


1.22 


1.30 


439806 


AA846824 


HS.1609D3 


ESTs 


0.80 


Z04 


439910 


H66765 


Hs.339397 


ESTs 


1.28 


Z16 


437886 


BE264111 


Hs.31314 


reUnoblastoma-binding protein 7 


1.06 


Z82 


441354 


AA931221 


Hs.126813 


ESTs 


1.20 


Z28 


428951 


AL138153 


Hs.300410 


ESTs, Moderatery similar to A47582 &cel 


1.50 


1.83 


438272 


AI167963 


Hs.143700 


ESTs, Weakly similar to S65824 reverse 1 


1.34 


Z51 


429642 


X68264 


Hs.211579 


melanoma cell adhesion molecule 


1.18 


1.18 


422121 


AI767949 


Hs.179833 


ESTs 


1.18 


Z26 


411184 


AW821117 




gb:PM2-ST0303-17010(W)0^g03 ST0303 Homo 


1.18 


Z21 


435871 


AF257077 


Hs.283627 


eukaryotic translation initiation factor 


1.17 


1.57 


430570 


AI417881 


Hs.292464 


ESTs 


1.49 


3.17 


431995 


ALD80197 


Hs.272302 


hypothetical protein 


1.52 


Z11 
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451326 


AW296946 


Hs.256078 


ESTs 


1.19 


2.18 


437046 


BE149154 




gb:RC2-HT0252-271099-Ol7-c11 HT0252Homo 


1.18 


2.25 


410154 


F06959 




gb:HSC1QD01 1 normalized infant brain cDN 


1.41 


2.05 


434373 


A1565566 


Hs.168587 


ESTs 


1.39 


1.33 


444552 


AW295211 


Hs.230777 


ESTs 


1.36 


2.20 


411608 


AW853441 




gb:RC1*CT0252-030100^)23-g09 CT0252 Homo 


2.12 


1.80 


440573 


BE550891 


Hs.270624 


ESTs 


2.19 


2.17 


443047 


AW1 57377 


Hs.132910 


ESTs 


1.81 


2.28 


451473 


AW298047 


Hs.346198 


ESTs 


1.18 


2.30 


416265 


AA177088 


Hs.190065 


ESTs 


2.37 


3.38 


435375 


AI733610 


Hs.1 87832 


ESTs 


1.12 


2.18 


401469 






NM_022137*:Homo sapiens secreted modular 


1.32 


1.61 


456152 


AA174126 


Hs.332163 


ESTs 


1.26 


2.50 


415808 


R21439 


Hs.334578 


Homo sapiens, clone IMAGE:3929520, mRNA 


1.39 


2.43 


452721 


AJ269529 


Hs.301871 


solute carrier family 37 (glycerol-3-pho 


0.92 


Z20 


435127 


A1217926 


Hs.179863 


EST 


1.36 


2.65 


420772 


AW752656 


Hs.222707 


KIAA1718 protein 


1.19 


1.50 


456332 


AA228357 




gb:nc39d05.r1 NCI_CGAP_Pr2 Homo sapiens 


1.45 


3.57 


444678 


AI741513 


Hs.143739 


ESTs 


1.43 


1.62 


446175 


AL036568 


Hs.291 


glutamyl aminopeptidase (aminopeptidase 


1.00 


1.53 


416463 


H59241 




Homo sapiens cDNA FU 11095 fis, done PL 


1.44 


213 


405158 






ENSP00000243337:CONA FU13984 fis, clone 


1.38 


2.68 


403903 






C5001 632*:gi| 1 0645308|gb|AAG21 430.1 JACOO 


1.32 


1.43 


407271 


X98937 




gb:H.sapiens rearranged Ig heavy chain { 


1.40 


2.68 


413929 


BE501689 


Hs.75617 


collagen, type IV, alpha 2 


1.59 


1.33 


450778 


U81375 


Hs.25450 


solute carrier family 29 (nucleoside tra 


1.17 


1.10 


434274 


AA628539 


Hs.116252 


ESTs, Moderately similar to ALU1_HUMAN A 


1.92 


2.80 


400075 






Eos Control 


1.76 


2.60 


433694 


AI2G861 1 


Hs.12066 


Homo sapiens cDNA FU11720 fis, clone HE 


1.48 


2.33 


454826 


AW833676 




gb:QV4-TT0O0B-1 81 199-038-h04 n0008 Homo 


1.57 


Z89 


415168 


AA160805 


Hs.199832 


ESTs, Weakly similar to 178885 serine/th 


2.08 


1.76 


439486 


AF086303 


Hs.103185 


ESTs 


1.49 


2.19 


403291 






Target Exon 


1.36 


2.28 


438618 


AA897673 


Hs.1 23457 


ESTs 


0.75 


0.79 


455087 


AW855389 




gb:CM3-CT0275-191099-024-e06 CT0275 Homo 


0.91 


2.63 


408075 


AA382881 


Hs.42409 


CGI-146 protein 


1.46 


215 


436826 


AA731863 


Hs.120276 


ESTs 


1.04 


3.11 


408961 


AW297475 


Hs.323180 


ESTs 


1.25 


1.39 


424408 


AI754813 


Hs.146428 


collagen, type V, alpha 1 


1.64 


2.05 


423300 


AK000742 


Hs.1 26774 


L2DTL protein 


1.47 


3.44 


403217 


AL134878 




ribosoma) protein, large P2 


1.70 


2.22 


437990 


AI685579 


Hs.121784 


ESTs 


2.14 


1.69 


419156 


AC002366 


Hs.46329 


ameJogentn (X chromosome, amelogenesis i 


1.40 


1.45 


411817 


BE302900 


Hs.72241 


mitogen-activated protein kinase kinase 


1.18 


1.12 


425701 


AA361850 


Hs.322149 


Human clone 137308 mRNA, partial cds 


1.60 


Z15 


418757 


AI864193 


Hs.169728 


hypothetical protein FU13150 


1.57 


Z23 


415184 


AA380436 


Hs.211973 


homolog of Yeast RRP4 (ribosoma! RNA pro 


1,32 


Z09 


414918 


AI219207 


Hs.72222 


hypothetical protein FU 13459 


1.61 


1.50 


401723 






Target Exon 


1.01 


268 


439010 


AW170332 


Hs.75216 


Homo sapiens cDNA FU13713 fis, clone PL 


1.28 


1.65 


449166 


BE168981 


Hs.23131 


kinesin family member C3 ' 


1.64 


2.58 


410642 


AW792784 




gb:CMO-UM0001-010300-258-M1 UM0001 Homo 


1.54 


1.90 


409556 


D38616 


Hs.54941 


phosphorylase kinase, alpha 2 (liver) 


1.63 


1.21 


439894 


AA853077 


Hs.300697 


immunoglobulin heavy constant gamma 3 (G 


0.76 


0.61 


401913 






ENSP00000249158*:CDNA 


0.97 


2.59 


406097 






Target Exon 


1.11 


1.23 


414745 


AA160511 


Hs.5326 


amino acid system N transporter 2; porcu 


1.29 


1.12 


445752 


AI733942 


Hs.344887 


ESTs 


2.03 


1.68 


408052 


AW501117 


Hs.283585 


ESTs 


1.32 


1.72 


407256 


AA204763 


Hs.288036 


IRNA isopentenylpyrophosphate Iransferas 


1.01 


2.09 


423264 


AJ133439 


Hs.1 26076 


Glutamate receptor interacting protein 


1.51 


2.39 


418859 


AA229558 




gb:nc15d10.s1 NCLCGAP.Prl Homo sapiens 


1.40 


2.35 


410370 


AB037753 


Hs.62767 


KIAA1332 protein 


1.34 


2.00 


417264 


AA195100 


Hs.188695 


ESTs 


109 


2.61 


444909 




HS.1922BU 


to IS 


1 

1.90 


CMC 


419386 


AA236867 




ESTs, Weakly similar to 138022 hypothetl 


1.14 


Z28 


439212 


AF087995 


Hs.134877 


ESTs 


1.06 


Z90 


437766 


W69171 


Hs.71741 


ESTs, Highly similar to 138945 melanoma 


1.25 


Z28 


448951 


A1611221 


Hs.334802 


hypothetical protein FU10101 


1.88 


Z17 


401659 






Target Exon 


1.63 


2.05 


419145 


N99638 




gb:za39g1 1.r1 Soares fetal liver spleen 


2.63 


3.85 


444*13 


AW054834 


Hs.210356 


ESTs 


1.73 


Z14 


433902 


AW292820 


Hs.144906 


ESTs 


1.88 


Z46 


403072 






NM_003319':Homo sapiens titin (TTN), mRN 


1.32 


Z91 


452484 


AB033042 


Hs.29679 


cof actor required for Spl transcriptiona 


0.72 


0.70 


456788 


AA724612 


Hs.133130 


Homo sapiens mRNA; cDNA DKF2p566H0124 (f 


1.90 


3.40 


403315 
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1.22 


ZOO 
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408432 


AJicojl lb 




/C770C 


AAOO<3)70 


Hs. 160884 




AA5oUoo5 


Hs.326082 




AW016722 


Hs. 194976 


4UN/J 






4443 lb 


Z48533 


Hs.283742 


*W0U« 


H61502 


LI. 4 fttl r 

Hs. 10235 




Y16645 


Hs.271387 


hOUO I f 


N71597 


Hs.29698 


4/14497 






<KJUOOO 


AW970093 


(J- oiirn 

Hs.24453. 


WO/ 






ATRROR 

4oocyo 


AW977385 


Hs.278615 


Midi A 

4iisr4 


AWB80414 


Hs.84264 


41^0^0 


A 14 1)4170 

AJ1 23478 


Hs.32112 




AW295364 


Hs.255418 


4Z4S3] 


AA775471 


Hs. 241 457 


44J1UU 


A1Q33168 




445332 


Al 220225 


Hs.321057 


414/01 


D50917 


Hs.772}3 


421893 


NMJJQ1078 


Hs.109225 


424265 


AF173901 


Hs. 144287 


446667 


BE161878 


Hs.224805 


426399 


AA652588 


Hs.301348 


438190 


AA780020 


Hs.1 36798 


406972 


M32053 




41/uoo 


a a <flj Ate 

AA194446 




44041 U 


a lie i -inn 
AI361109 


Hs. 151 721 


4^/0/4 


NM_0D352o 


Hs.2178 


422526 


AA311763 


Hs.1 3 1056 


439317 


AF086127 


Hs.50600 


*»U91ZD 


A a nc O 1 Of* 

AA063426 




412093 


8E242691 


Hs. 14947 


AOA1CO 


AA256126 


Hs.16179 


426096 


D87436 


Hs. 16631 8 


mjcoo i 






405760 






402901 






453982 


AW014252 


Hs.252837 


424244 


AV647184 


Hs.143601 


439984 


BE559514 


Hs.275425 


457297 


AW968188 




415054 


AI733907 




426273 


AI174861 


Hs.190623 


405187 






413939 


AL047051 


Hs.199961 


427596 


AA449506 


Hs.270143 


408049 


AW076098 


Hs.345588 



406002 
408284 
431377 
451456 
427530 
431957 
422283 
419600 
423314 
451690 
454662 
454413 
416861 
415908 
438942 
407618 
429177 
448357 
422008 
434461 
413489 
405551 
441183 
456034 
420611 
422061 
437908 
423052 
401927 



AW248254 

AW178807 

AW386183 

AA405093 

AK002104 

AW411307 

AA448958 

AI400661 

AW451469 

AW812715 

AI653672 

AW977206 

H08623 

AW875398 

AW054922 

AA447527 

N20169 

AJ000534 

AA744046 

BE144228 

BE562910 

AW450979 

AA994635 

BE178434 

AI082424 

M28214 



Hs.44101 

Hs.246182 

Hs.210305 

Hs.126519 

Hs.272246 

Hs.114311 

Hs.91481 

Hs.127811 

Hs.209990 

Hs.40092 

Hs.151858 

Hs.22833 

Hs.6451 

Hs.53478 

Hs.207429 

Hs.108923 

Hs.1 10708 

Hs.1 33350 



Hs.92679 

Hs.129929 
Hs.267995 

Hs.123072 



C01E antigen, e polypeptide 

ESTs 

ESTs 

SH2 domain-containing phosphatase anchor 
Target Exon 

Rsapiens mRNA for retrotransposon 
chromosomB 5 open reading frame 4 
small inducible cytokine subfamily A (Cy 
ESTs, WeaJdy similar to ZN91.HUMAN ZINC 
C8000068*:gij5453579|refjNP 006120.1 1 bo 
ESTs 

Target Exon 
ESTs 

acidic protein rich in leucines 

ESTs 

ESTs 

ESTs 

gb:ow94e08.s1 SoaresJetalJiver._spleen_ 
ESTs 

WAA0127 gene product 
vascular cell adhesion molecule 1 
hauy/enhancer-of-spiit related with YRP 
ESTs 

Homo sapiens cDNA FU13271 Us, clone OV 
ESTs, Moderately similar to KBF3.HUMAN N 
gb:Human H19 RNA gene, complete cds. 
ESTs, Weakly similar to S55024 nebuiin, 
ESTs, Weakly similar to I38022 hypotheU 
H2B histone family, member Q 
ESTs 

ESTs, Weakly similar to T471 56 hypothec 
gb:zf7Qc0&s1 Soaresj>ineal_gland_N3HPG 
ESTs 

hypothetical protein FU23467 
lipin 2 

NMJW5012*:Homo sapiens receptor tyrosin 
Target Exon 

NMJ)25206*:Homo sapiens hypothetical pro 
ESTs 

hypothetical protein hCLA-iso 
hypothetical protein 

gb:EST380383 MAGE resequences, MAGJ Homo 
gb:zo86h09.y5 Stratagene ovarian cancer 
ESTs 

NMJ)14272:Homo sapiens a disintegrin-lik 

ESTs, Weakly similar to ALU7_HUMAN ALU S 

extracellular glycoprotein EMIUN-2 prec 

desmoplakin (DPI, DPI!) 

Target Exon 

protein kinase PKNbela 

ESTs 

ESTs 

ESTs 

hypothetical protein FU11292 
CDC45 (cell division cycle 45, S.cerevis 
NEU1 protein 

disintegrin metalloproteinase with throm 
ESTs 

gb:RC4-ST0185-271099411-g01 ST0185Homo 

PNAS-123 

ESTs 

ESTs 

PRO0659 protein 

Homo sapiens cDNA FU12366 lis, done MA 
ESTs 

RAB38, member RAS oncogene family 
sarcogtycan, epsilon 
ESTs, Weakly simiJar to 178885 serine/th 
gb:Mr^HT0165-14020(M094Q4 HT0165 Homo 
Target Exon 

Homo sapiens done CDABP0014 mRNA sequen 
gb:UKr«l3-ala-a-12-0-Ul.s1 NCI.CGAP Su 
ESTs 

ESTs, Moderately similar to G02654 ribos 
ESTs 

RAB3B, member RAS oncogene family 
a7000914*:gi]8394367|refINP - 058549.1| s 
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Z12 


3.30 


1.14 
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204 
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0.87 
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1.38 


207 


200 


225 


0.80 
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1.44 
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203 


1.40 
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215 


1.64 


3.17 


1.00 
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432967 


AA572949 


Hs.207566 


ESTs 


1.52 


2.28 


439159 


AF087972 


Hs. 120938 


ESTs 


2.03 


2.08 


415357 


H22757 


Hs.1 3471 


ESTs 


1.82 


2.07 


442327 


AA991745 


Hs.42522 


ESTs 


1.48 


2.88 


430186 


AB020696 


Hs.234791 


K1AA0889 protein 


1.46 


2.23 


426971 


AI809984 


Hs.243209 


ESTs, Weakly similar to NPA1_HUMAN NEURO 


1.06 


2.13 


422687 


AW06B823 


Hs.1 19206 


Insulin-like growth factor binding prote 


1.61 


1.37 


432954 


AI076345 


Hs.214199 


ESTs 


1.19 


2.84 


429040 


AL035542 


Hs.248169 


olfactory receptor, family 2, subfamily 


1.25 


1.44 


414169 


AA136169 


Hs.149335 


ESTs 


1.59 


2.51 


419882 


AA687313 


Hs.1 90043 


ESTs 


1.20 


2.50 


426900 


AW163564 


Hs.142375 


ESTs 


1.87 


1.77 


418773 


T39748 


Hs.325474 


Target CAT 


1.35 


2.02 


439776 


AL360140 


Hs.1 76005 


Homo sapiens mRNA full length insert cDN 


1.32 


2.10 


428712 


AW085131 


Hs.1 90452 


KIAA0365 gene product 


1.41 


1.52 


408839 


AW2770B4 




gb;xp61h09.x1 NCLCGAP_Ov39 Homo sapiens 


1.14 


2.03 


450492 


AW290961 


Hs.201815 


ESTs 


1.17 


2.21 


434654 


AI825942 


Hs.1 39366 


Homo sapiens clone 15 polyadenylated HER 


1.62 


2.31 


457567 


AW939074 




gb:GV1-DT0069-010200-057-c12 DT0069 Homo 


1.80 


3.73 


452426 


AI904823 


Hs.31297 


duodenal cytochrome b 


2.15 


1.84 


418559 


AA225048 


Hs. 104207 


ESTs 


1.84 


2.33 


439099 


AB037800 


Hs.6462 


protein kinase C and casein kinase subst 


1.07 


1.15 


451984 


R60571 


Hs.27406 


Homo sapiens mRNA; cDNA DKFZp566F1946 (f 


1.18 


1.22 


420789 


AI670057 


Hs.199882 


ESTs 


2.24 


2.55 


456396 


AA236863 


Hs.188894 


ESTs, Weakly similar to APXLHUMAN APICA 


1.07 


1.14 


402948 






NMJ)25206:Homo sapiens hypothetical prot 


2.41 


1.83 


426405 


AW296631 


Hs.283403 


ESTs 


1.28 


1.07 


439732 


AW629604 


Hs.1 67641 


hypothetical protein from EUROIMAGE 1703 


0.85 


0.77 


416784 


AA334592 


Hs.79914 


lumican 


1.88 


1.27 


422531 


AW967280 


Hs.293894 


ESTs, Weakly similar to HERC2 [H.sapiens 


1.20 


1.25 


415608 


F12795 


Hs.12286 


ESTs, Moderately similar to ALU1_HUMAN A 


1.03 


2.31 


428671 


BE297B51 


Hs.189482 


zinc finger protein 179 


1.26 


2.20 


420007 


H13700 


Hs.31235 


ESTs, Weakly similar to Y934.HUMAN HYPOT 


1.60 


2.25 


400850 






Target Exon 


1.22 


1.03 


404580 






trichorhinophalangeal syndrome I gene (T 


1.00 


1.00 


407680 


AW064284 


Hs.279153 


ESTs 


1.02 


2.28 


410420 


AA224053 


Hs.172405 


cell division cycle 27 


1.64 


1.00 


421234 


AA907153 


Hs.190060 : 


ESTs 


1.76 


1.45 


426791 


AA384910 


Hs.46519 


ESTs 


1.12 


2.15 


430439 


AL1 33561 




DKFZP434B061 protein 


1.00 


1.00 


434036 


AI659131 


Hs.197733 


hypothetical protein MGC2849 


1.00 


1.00 


438915 


AA280174 


Hs.285681 


Williams-Beuren syndrome chromosome regi 


1.40 


1.00 


440304 


BE159984 


Hs.1 25395 


ESTs 


1.00 


1.00 


441699 


AW511126 


Hs.127572 


ESTs 


1.00 


1.73 


443383 


A1792453 


Hs.166507 


ESTs 


2.04 


1.00 


445660 


AI702668 


Hs.201955 


ESTs 


1.00 


1.00 


453160 


AI263307 


Hs.239884 


H2B histone family, member L 


1.00 


1.00 


456513 


AA279143 


Hs.88561 


ESTs 


1.00 


1.40 


457231 


A1472022 


Hs.301959 


proline synthetase co-transcribed (bacte 


0.98 


0.63 


459565 


W27086 


Hs.209694 


ESTs 


1.00 


1.00 


429317 


AA831552 


Hs.268016 


Homo sapiens cDNA:RJ21243 lis, clone C 


1.00 


1.00 


430971 


M26150 


Hs.248177 


H3 histone family, member L 


1.14 


1.40 


408376 


AW971303 


Hs.292601 


ESTs 


1.08 
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432250 


AA4520B8 


Hs.274170 


Opa-tnteracling protein 2 

Homo sapiens cDNA FLJ20149 fis. clone CO 


1.26 


2.71 


431911 


AK000156 


Hs.272193 


1.46 


2.60 


413923 


A1733852 


Hs.1 99957 


ESTs 


1.62 


2.10 


449590 


AA694070 


Hs.268835 


ESTs 


1.20 


2.53 


438467 


AA808027 


Hs.1 23277 


ESTs 


1.48 


2.10 


432121 


A1824879 


Hs.211286 


ESTs, Weakly similar to 1207289A reverse 


1.27 


3.13 


412298 


AW936300 




gb:QV4-DT0021-281299-070-a04DT0021 Homo 


1.42 


2.60 


408519 


AA679082 


Hs.43481 


hypothetical protein DKFZp564K192 


1.84 


3.70 


416067 


T79732 


Hs.1 4633 


ESTs 


1.11 


3.08 


420497 


AW206285 


Hs.253548 


. ESTs 


1.90 


2.48 


405704 






NM_001844*:Hcmo sapiens collagen, type I 


1.42 


2.90 


423443 


AI432601 


Hs.168812 


Homo sapiens cDNA FU14132 fis, clone MA 


1.42 


2.03 


415904 


Z44679 


Hs.336391 


ESTs 


1.62 


2.94 


413786 


AW613780 


Hs.13500 


ESTs 


0.33 


0.17 


404031 






C5001 700*:gi|9256616|ref|NP_061761. 1| pr 


1.94 


2.29 


457412 


N40711 


Hs.333300 


hypothetical protein FU14026 


1.92 


3.20 


439719 


AF086554 


Hs.326048 


Homo sapiens mRNA; cDNA DKFZp434M0420 (f 


1.62 


2.30 


418161 


AI950754 


Hs.81716 


ESTs 


1.81 


2.42 


425894 


AW954011 


Hs.180711 


ESTs 


0.92 


2.20 


419988 


W39388 


Hs.55336 


Homo sapiens, clone MGC:17421, mRNA, com 


1.34 


2.57 


439668 


AI091277 


Hs.302634 


frizzled (Drosophila) homolog 8 


1.67 


2.66 


450177 


AI698091 


Hs.107845 


ESTs 


1.50 


£25 


459704 


AA719572 


Hs.274441 


Homo sapiens mRNA; cDNA DKFZp434N011 (fr 


1.27 


3.35 


410357 


AW663614 




gb:hj22e04.x1 NCLCGAPJJ8 Homo sapiens 


0.69 


0.59 


459234 


AI940425 




gb:CM0-CT0052-150799-024-c04 CT0052 Homo 


1.67 


2.08 


421313 


NM_014923 


Hs.103329 


KIAA0970 protein 


0.57 


0.26 


431322 


AW970622 




gb:EST382704 MAGE resequences, MAGK Homo 


1.80 


£73 


423086 


AB028984 


Hs.123420 


KIAA1061 protein 


0.40 


0.56 


425980 


AA366951 




gb:EST77953 Pancreas tumor 111 Homo sapi 


1.33 


£50 


423185 


BE299590 


Hs.125078 


ornithine decarboxylase antizyme 1 


0.60 


0.56 


410840 


AW806924 




gb:QV4-ST0023-160400-172-h10 ST0023 Homo 


1.50 


£88 


403917 






Target Exon 


182 


£02 


437384 


AI674710 


Hs.174397 


ESTs 


1.26 


2.05 


444389 


AW439340 


Hs.189720 


ESTs 


1.26 


£13 


443318 


A1051603 


Hs;133141 


ESTs 


1.46 


£20 


441093 


AI698138 


Hs.126918 


ESTs 


1.40 


£35 


439432 


AI9S4203 


Hs.57874 


ESTs 


0.88 


£18 


454629 


AW811114 




gb:MR2-ST0131-11 1199-016-a04 ST0131 Homo 


1.96 


£31 


406207 






Target Exon 


£77 


£55 


444872 


AI936264 




p30 DBC protein 


1.48 


£45 


401908 






C17000154:gi|12003980lgb|AAG43830.1|AF21 


1.15 


£28 


404730 






Target Exon 


1.84 


£78 


457498 


AI732230 


Hs.191737 


ESTs 


1.49 


£55 


448471 


AA158617 


Hs.21276 


collagen, type IV, alpha 3 (Goodpasture 


0.37 


0.36 


438978 


AI095207 


Hs.307972 


ESTs 


1.57 


£39 


418786 


AI796317 


Hs.203594 


Homo sapiens uncharacterized gastric pro 


286 


3.34 


400416 


AF083130 




Homo sapiens CATX-14 mRNA, partial cds 


£03 


1.55 


450446 


A1696334 


Hs.14450 


ESTs 


1.32 


£38 


419791 


A1579909 


Hs.105104 


ESTs 


0.41 


0.27 


449436 


AA860329 


Hs.279307 


hypothetical protein DKFZp434l2117 


Z01 


1.50 


430808 


L08603 


Hs.247980 


melanocortin 4 receptor 


1.09 


2.18 


443116 


A1033397 


Hs.132225 


ESTs 


1.30 


£25 


437923 


BE088433 


Hs.334696 


hypothetical protein KIAA1335 


1.40 


2.50 


403294 






Target Exon 


0.98 


£18 


436007 


AI247716 


Hs.232168 


ESTs 


1.38 


1.00 


430649 


AB040941 


Hs.247713 


K1AA1508 protein 


1.52 


£85 


437271 


AL137445 


Hs.28846 


Homo sapiens mRNA; cDNA DKFZp5660134 (fr 


1.29 


£59 


444500 


AV651273 


Hs.282966 


ESTs, Moderately similar to 2109260A B c 


1.22 


£05 


447434 


R16890 


Hs.137135 


ESTs 


1.72 


£85 


400830 






NM_025006:Homo sapiens hypothetical prot 


£04 


2.68 


428114 


AI821548 


Hs.98363 


ESTs, Weakly similar to I38022 hypotheti 


1.09 


£74 


409688 


A1150485 




gb:qf36a10.x1 Soares JestisJIHT Homo sap 


1.67 


1.38 


440781 


BE561823 


Hs.281434 


Homo sapiens cDNA FU 14028 fis, clone HE 


1.28 


£50 


442662 


U78168 


Hs.8578 


Rap1 guanlne-nucleotide-exchange factor 


1.92 


2.28 


443078 


M78728 


Hs.1 32694 


Homo sapiens cDNA: FU23149 fis, done L 


1.42 


o n*j 
cMq 


440179 


AI990151 


Hs.125904 


ESTs 


1.49 


2.63 


446780 


R31107 




gb:yh61g01.s1 Soares placenta Nb2HP Homo 


1.96 


£78 


444173 


A1126432 


Hs.149493 


ESTs 


1.50 


£10 


417939 


R53863 


HS.337512 


ESTs, Weakly similar to ALUB.HUMAN !!!! 


1.60 


£03 


428490 


BE301738 


Hs.49806 


ESTs, Weakly similar to A46010 X-linked 


0.47 


0.44 


443869 


AI141520 


Hs.151464 


ESTs, Weakly similar to ALUC_HUMAN HI! 


1.25 


£68 


426322 


J05068 


Hs.2012 


transcobalamin 1 (vitamin B12 binding pr 


2.12 


1.15 


411630 


U42349 


Hs.71119 


Putative prostate cancer tumor suppresso 


0.64 


0.48 


454701 


AW854930 




gb:PMO-CT0263-201099-003-f06 CT0263 Homo 


1.30 


£33 


439795 


N77294 


Hs.194294 


ESTs 


1.17 


£33 


425546 


BE409762 


Hs.26118 


hypothetical protein MGC 13033 


1.17 


2.85 


411245 


AW833441 




gb:QV4.n0008*271099^)20-g01 TT0008 Homo • 


1.90 


3.98 
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443153 
434420 
426126 
421926 
459563 
453006 
437223 
417015 
420223 
425303 
400375 
456169 
409707 
422241 
443152 
452714 
415110 
443251 
433441 
434612 
417807 
426902 
436026 
428878 
439749 
442435 
416527 
441808 
417054 
446636 
418442 
416640 
403146 
457397 
439189 
423969 
459683 
426826 
414462 
438027 
408623 
433765 
417132 
416815 
435186 
411107 
406930 
411026 
415766 
446018 
440125 
449832 
-431899 
431531 
441077 
426799 
419480 
455908 
403332 
455753 
404429 
438941 
428745 
411567 
458714 
426839 
444539 
407322 
453826 
435695 



A1371823 

AA688278 

AL118747 

AA300591 

AI590487 

AJ362575 

C15105 

AA837098 

N27807 

AA354765 

NM.014115 

Y07909 

M861773 

Y00062 

A1803470 

AW770994 

H04043 

BE185436 

R37094 

R76513 

R17806 

AI125334 

AA731124 

AA436884 

AL389942 



AF283775 Hs.35380 x 001 protein 047 
AA362525 gb:EST72223 Namaiwa B ceils I Homo sapie 1.38 

T51454 Hs.82845 Homo sapiens cDNA: FU21930 fis, clone H 0.34 
AL137566 Hs.32405 Homo sapiens mRNA; cDNA DKFZp5BSG0321 {f 1.28 
AL117574 Homo sapiens mRNA; cDNA OKFZp434L2221 {f 2.04 

H18047 Hs.335821 ESTs 2.02 
Hs.151999 ESTs 1 68 

Hs.34079 ESTs 1 13 

Hs.194864 hypothetical protein FU22578 1.34 
Hs.26691 ESTs 1.31 
gb:EST1 3437 Testis tumor Homo sapiens cD 1 .48 

Hs.49760 gb:tt77d04jc1 NCLCGAP.HSC3 Homo sapiens 1,74 
Hs.303171 ESTs 1 17 

Hs.330716 Homo sapiens cDNA FU 14368 fis, clone HE 0.54 
Hs.269933 ESTs t.04 
Hs.286 ribosomal protein L4 ^08 
gb:EST63098 Jurkat T-cefls V Homo sapien 118 
NM_014115*:Homo sapiens PRO01 13 protein 1.83 
Hs.79368 epitheBal rnenibrane protein 1 154 
Hs.313501 ESTs 0.79 
Hs. 1 701 21 protein tyrosine phosphatase, receptor I 1 ^46 

Hs.204529 KIAA1 806 protein 1 07 

Hs.30340 hypothetical protein KIAA1 165 0.45 
gb:yj45c03.r1 Soares placenta Nb2HP Homo 1.62 
Hs.278839 ESTs 134 
HsA3742 ESTs U6 

Hs.301 1 83 molecule possessing ankyrin repeats Indu 0 83 
Hs.269452 gb.7gOgbO6.rl Soares infant brain 1NIB H 1.30 
Hs.97408 ESTs 1 94 

Hs.120931 ESTs 201 

Hs.48926 ESTs ^22 

Hs. 1 57752 Homo sapiens mRNA full length insert cDN 1 [32 

Hs.244760 ESTs, Highly similar to B34087 h/potheti 2.09 

Hs.11038 ESTs 166 

Hs.127887 ESTs. Moderately similar to 1609195B bio 1.22 

Hs.174151 aldehyde oxidase 1 2.57 

Hs. 15767 citron (too-interacting, serine/threonin 1 16 

Hs. 186898 ESTs 1.39 

Hs.79404 neuron-specific protein rj.31 

Target Exon 1 49 

Hs.109154 ESTs ^33 

Hs. 144630 nuclear receptor subfamily 2, group F, m 1 ,76 

Hs.34959 hypothetical protein DKFZp566N034 1.18 

Hs.199460 gbrwc73TO2j<1 Na_CGAP_Pan1 Homo sapiens 1.74 

Hs. 1 72654 guanine nudeoBde binding protein beta 204 

Hs.301064 arfeptin 1 n.40 

Hs.19131 transcription factor Dp-2 (E2F dimerizat 1.08 

Hs.254037 ESTs 1 64 

Hs.112668 ESTs ^52 

Hs.269053 ESTs ti64 

Hs.80120 UDP-N^ty|.alpha-D^alar^mine:polyp 028 

Hs. 145631 ESTs 174 

Hs.95870 PTD015 protein 0.49 

gb:Human olfactory receptor (OR17-219)g 2.21 

gb:RC3-ST0197.120200-015-b05 ST0197 Homo 1.64 

Hs.50628 adaptor-related protein complex 4, sigma 1 .64 

Hs.249727 gb:hh92e12y1 NCI_CGAP_GU1 Homo sapiens 1.56 

Hs.253888 ESTs 146 

Hs.60049 ESTs ^27 

Hs.187726 ESTs till 

Hs.62654 kringle-containing transmembrane protein iio6 

Hs.15312 ESTs " 1 12 

Hs.3031 54 popeye protein 3 0 61 

Hs.1 22546 hypothetical protein FU2301 7 1 88 

gb:QVO-HT0367-150200-114-h04 HT0367 Homo 1.77 

Target Exon \ 46 

gb:PM1-BT0585-11020f>003^02 BT0585 Homo 1.40 

Target Exon 1 31 

Hs.31864 ESTs 1*34 

Hs.201634 ESTs 1*72 

gb:MR2-CT0222-2110994)02-h06 CT0222Homo 1.60 

Hs.344777 ESTs 0.93 

Hs.1 72689 interteuxin 3 receptor, alpha (towaffin 1 39 

Hs.146907 ESTs, Weakly similar to 2004399A chromos 1.66 

Hs.324570 ESTs, Weakly similar to ALU3.HUMAN ALU S 1.30 

gb:DKFZp547F1 52_r1 547 (synonym: hforl) 1 52 

Hs.257675 ESTs 1.24 
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T62507 
AW118601 
AF017060 
AC002563 
AI873471 
BE262478 

AW969025 
AI951185 
AI830571 
AI674906 
AK0018SO 
BE622743 
N93047 
AW811978 
AA909619 
N56605 
U41514 
AL119470 
AW958042. 
U04691 
AW813786 
H01613 
AW631111 
AW238410 
AA694264 
AA521381 
BE142052 
AI241273 
H14843 
BE536584 
BE156306 

BE075124 

AF075047 
AA433896 
AW851630 
R20916 
M74782 
A1955765 
AA171892 
AL138129 
AA694324 



0.41 
2.63 
0.28 
119 
1.79 
3.17 
245 
141 
Z38 
125 
140 
3.33 
124 
0.46 
118 
3.10 
185 
114 
108 
0.84 
106 
143 
0.34 
• 107 
105 
205 
2.88 
123 
120 
1.73 
117 
175 
3.13 
112 
158 
1.48 
123 
226 
0.26 
118 
2.26 
190 
200 
2.00 
1.60 
0.29 
2.40 
3.08 
102 
151 
0.16 
3.10 
0.24 
3.88 
1.03 
151 
2.48 
125 
2.33 
153 
100 
113 
0.51 
138 
255 
260 
243 
201 
221 
206 
270 
107 
171 
118 
165 
173 
100 
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402294 






Target Exon 


1.80 


3.08 


417759 


R13567 


Hs.12548 


ESTs 


1.0J 


2.58 


417527 


AA203524 




gb:zx56e10.r1 SoaresJetalJiver_spleen_ 


4 CO 

1.52 


2.02 


427526 


AA405O62 


Hs.345830 


gb:a)12eQ4.r1 Soares_testis_NHT Homo sap 


2.03 


1.90 


455300 


AW891707 




gb:CM3-NT0090440500-171-e02 NT0090 Homo 


1.12 


220 


448121 


AL045714 


Hs. 128553 


hypothetical protein DKFZp564F013 


0.93 


2.28 


415855 


AI921875 




gb:wpfl7e04.x1 NCLCGAP_Wd12 Homo sapien 


1.43 


2.08 


425702 


N59555 




gb:yv76f05.s1 Soares fetal liver spleen 


1.61 


2.80 


441056 


H37860 


Hs.1 25720 


ESTs 


1.11 


2.10 


400311 


AF072164 


Hs.137570 


Homo sapiens HSFE-1 mRNA. partial cds 


2.04 


2.95 


451478 


NM_012331 


Hs,26458 


methionine sulfoxide reductase A 


0.50 


0.20 


425268 


AA354502 




gb:EST62799 Jurkat T-cells V Homo sapien 


0.99 


2.08 


456397 


W28339 


Hs.150580 


PTD010 protein 


1.11 


2.29 


405654 






C1 2001 521 ;gi|7513934lpirj|T31081 cca3 pr 


2.30 


1.00 


450151 


AI088196 


Ks.22968 


Homo sapiens clone IMAGE:451939, mRNA se 


1.21 


2.60 


419851 


AA287987 


Hs. 13477 


ESTs, Weakly similar to 1207289A reverse 


1.26 


260 


406016 






Target Exon 


0.57 


0.48 


440903 


AI468079 


Hs.1 26623 


ESTs 


2.02 


1.61 


445026 


W90337 


Hs.282966 


ESTs, Moderately similar to 2109260A B c 


1.56 


2.23 


414182 


AA136301 


Hs.344442 


KIAA1105 protein , 


1.32 


2.55 


457048 


AA400352 


Hs;112861 


ESTs 


1.54 


2.05 


440542 


AA889143 


Hs.295655 


ESTs, Weakly similar to PC4259 ferritin 


1.48 


215 


422857 


R71461 




gb:yi51h07.r1 Soares placenta Nb2HP Homo 


1.42 


278 


445948 


AW444662 


Hs.202247 


ESTs 


1.50 


2.48 


454002 


BE299567 


Hs.271749 


ESTs, Moderately similar to ALU8_HUMAN A 


1.31 


2.25 


413656 


T91703 




gb:ye2Og09.s1 Stratagene tang (937210) H 


2.10 


1.69 


420441 


A3986160 


Hs. 180383 


dual specificity phosphatase 6 


0.99 


233 


412062 


H09124 


Hs.202341 


Homo sapiens cDNA: FU23573 fis, clone L 


2.14 


1.61 


408991 


BE501816 


Hs.281927 


ESTs 


1.76 


283 


432534 


AW361626 


Hs.339833 


hypothetical protein FLJ11240 


0.41 


0.28 


435136 


R27299 


HS.10172 


ESTs 


0.76 


3.40 


451052 


AA281504 


Hs.24444 


Homo sapiens cDNA: FU22165 fis, clone H 


2.16 


1.85 


413928 


AA442496 


Hs.6700 


ESTs, Moderately similar to Z195JWMAN Z 


1.30 


2.00 


439448 


AA9707B8 


Hs.257586 


ESTs 


1.87 


2.23 


403344 






NMJ00341 :Homo sapiens solute carrier fa 


1.36 


2.22 


418056 


AA524886 




gb:nh34f02.s1 NCI_CGAP_Pr3 Homo sapiens 


1.42 


2.85 


435428 


AI791746 


Hs.1 30293 


ESTs 


2.44 


1.32 


419964 


AA811657 


Hs.220913 


ESTs 


1.32 


208 


440926 


AW196772 


Hs.131323 


ESTs 


1.80 


265 


452625 


AA724771 


Hs.61425 


ESTs 


1.64 


2.18 


452797 


AI369787 


Hs.7146 


ESTs 


1.47 


3.16 


436120 


AI248193 


Hs.1 19860 


ESTs 


1.41 


2.83 


449567 


AI990790 


Hs.188614 


ESTs 


1.48 


245 


409628 


AB021865 


Hs.55276 


potassium voltage-gated channel, Shake 


1.70 


2.23- 


416617 


H69311 


Hs.205980 


ESTs 


1.83 


204 


452266 


AI767250 


Hs.165240 


ESTs 


0.58 


0.43 


404606 






Target Exon 


1.47 


3.75 


401814 






Target Exon 


2.00 


1.91 


428403 


AI393048 


Hs.326159 


leucine rich repeat (in FUI) interacts 


. 0.33 


0.21 


433390 


AA5B6950 


Hs.260180 


Homo sapiens mRNA: cDNA DKFZp761G18121 ( 


2.00 


4.90 


451443 


AW295527 


Hs.210303 


ESTs 


1.87 


225 


411188 


BE161168 




gD:rl^nlM^D-i7UiOU-UUZ-aiUnlu4iO homo 


0 1K 
£.10 


i .oy 


452704 


AA027823 


Hs.149424 


Homo sapiens PNAS-130 mRNA, complete cds 


2.64 


1.65 


424060 


X92108 




H.sapiens mRNA for subtelomeric repeat s 


240 


2.58 


433331 


A1738815 


Hs.117323 


ESTs 


1.46 


2.10 


428520 


AA331901 


Hs.184736 


hypothetical protein FU1O097 


0.44 


0.19 


439492 


AF086310 


Hs.103159 


ESTs 


0.42 


0.26 


426736 


M431615 


Hs.130722 


ESTs 


1.90 


245 


416225 


AA577730 


Hs.188684 


ESTs, Weakly simflar to PC4259 ferritin 


272 


6.25 


404917 






Target Exon 


1.60 


215 


448955 


AW207597 


Hs.28102 


ESTs 


2.08 


1.75 


402797 






Target Exon 


2.12 


1.37 


457951 


U23860 




gb:Human clone mcag19 chromosome 16 CTG 


1.72 


200 


426982 


AA149707 


Hs.173091 


ubiquib'n-like 3 


0.36 


0.17 



mm 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

408139 10421 1 AA451966 NM 01 6370 AB036693 AL1 39228 R581 24 AI634647 AL1 1 9333 W07356 A1334284 H29050 Al 192685 AA652438 AW1 72843 W19794 
N21460 AI743862 AW130622 AA991348 AI204553 AA9926S4 N80848 AA699329 AI824676 R26624 R49653 

408255 104935U AW807321 AW807262 AW177104 AW807319 AW8071 15 AW807344 AW807324 AW1781 16 BE141575 AW845849 AW8071O5 AW845868 
BE140942 AW807178 AW807167 AW807398 AW807320 AW807306 AWB45866 
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408839 
409126 
409282 
409291 
409367 
409688 
409692 
409894 
410154 
410357 
410642 
410725 
410730 
410744 



410956 
411008 
411026 
411141 
411184 
411188 
411245 
411347 
411432 

411496 
411567 
411590 
411608 
411652 
411880 
411918 
411920 



412085 
412252 
412274 
412298 
412302 
412679 
412902 

413035 
413068 
413196 
413346 
413444 
413489 
413593 
413645 
413656 
413750 
413885 
413888 
414051 
414623 
415054 
415110 
415157 
415361 
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BE011368 BE01 1 362 BE01 1 21 5 BE01 1 365 BE01 1 363 

456172 1603643 1 R99050 R99067 H67642 

456332 179104 1 AA228357AW841786 AW841716 

456381 184123 1 AA236606 AA459341 AA237079 

457297 313764 1 AW968188AA468196 AA468269 AA46829B 

457389 331080 1 AW970989AA502167 AA507546 t 

457567 357346*1 AW939074 AW939073 BE1 60476 AW939938 AW939206 AW940012 AW939076 AA573577 AW750479 AA574383 AW970057 

457581 359936*1 AA578512 AA595535 BE 177533 j ' * 

457741 395767*1 BE044740 AW827360 AW827623 BE161439 BE044718 BE046207 BE046551 AA653908 BE166581 

457871 426637*1 AI168278 AA868236 BE550792 AI522194 A1819707 AA973538 AI990086 AI628424 A1095270 AI991608 AA730741 

457951 44251 T U23860U80739 

459234 945240.-1 AI940425 

TABLE 8C 



Pkey: 
Ref: 

Strand: 
NLposition: 



Unique number corresponding to an Eos probeset ...... u . ^ ™ j^hm* 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gi) numbers. 'Dunham I. et al. refers to the publication entitled rne dna 
sequence of human chromosome 22.' Dunham I. et al., Nature (1999) 402:489-495. 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400704 


8118864 


Minus 


63110-63241 


400830 


8570385 


Plus 


157683-163035 


400834 


8705192 


Plus 


121963-122288 


400840 


9188586 


Plus 


113882.114121 


400850 


1927150 


Minus 


4506-4691 


400881 


2842777 


Minus 


91446-91603.92123-92265 


401016 


8117441 


Plus 


126234-126359.128050-128236 


401090 


8492704 


Minus 


201281-201460 


401215 


9858408 


Plus 


103739-103919 


401241 


4827300 


Minus 


30503-30844.31056-31248 


401335 


9884881 


Plus 


15736-16352 


401381 


8570226 


Minus 


118629-119146.119392-119657 


401400 


7708226 


Minus 


33028-33585 


401469 


6682292 


Warns 


125521-125639 


401473 


7249001 . 


Plus 


115142-117305 


401577 


9280797 


Minus 


139377-139674,141195-141281,142217-142340 


401658 


9100664 


Plus 


89638-90028 


401659 


7689875 


Minus 


183379-183521 


401686 


6468551 


Plus 


5005-5426,6810-7042 


401723 


7656694 


Plus 


147273-147503 


401814 


7408052 


Plus 


136003-136726 


401818 


7467933 


Minus 


10964-11084,11674-11817 


401890 


8516144 


Plus 


148955-149396,149569-150002 


401908 


8698760 


Minus 


126888-127024 


401913 


9369520 


Minus 


33753-33904 


401927 


3873185 


Minus 


112000-112137 


402025 


7547159 


Plus 


173835-173998 


402039 


7770432 


Plus 


560-1294 


402049 


8072512 


Plus 


100065-100419 


402085 


7249154 


Plus 


90533-90687,94949-95158 


402241 


7690131 


Minus 


125073-125206,130996-131125 


402294 


2282012 


Minus 


2575-3000 


402305 


7328724 


Plus 


40832-41362 
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402366 9454515 Plus 

402551 9856793 Minus 

402654 8076879 Plus 

402685 8318556 Rus 

402762 9230904 Minus 

402797 3421043 Minus 

402833 8918545 Plus 

402901 8894222 Minus 

402948 9368458 Minus 

403066 8954202 Rus 

4O3072 8954241 Rus 

403146 9799812 Rus 

403197 9930749 Plus 

403214 7630945 Minus 

403217 7630969 Rus 

403290 8083176 Rus 

403291 7230870 Rus 
403294 8096496 Rus 
403315 8247953 Minus 
403332 8568139 Minus 



403344 




Rus 


403362 


OO / I / / L 


Rus 


403371 




Rus 


403391 




Plus 




ay bob 15 


Minus 


403536 


On7CQ*>^ 

ou/o9<i4 


Plus 


403779 


oUloU40 


Minus 




7708954 


Rus 


40W71 


77/1QOCO 

7709262 


Plus 


lujyuj 


7710671 


Minus 


nuoy 1 I 


7710849 


Plus 


4fl?Q7fl 
'♦uoy/o 


O576014 


Rus 


4nAfl11 


7671252 


Rus 




9926594 


Minus 




6706820 


Rus 




2326514 


Rus 




7382420 


Minus 




7407959 


Plus 




7407979 


Plus 




7528051 


Rus 




8151634 


Minus 


404580 


6539738 


Minus 


*HJ4oUo 


9212936 


Minus 


404730 




Plus 


404917 


7341851 


Rus 


405033 


7107731 


Minus 


405137 


8570507 


Plus 


405146 


9438278 


Minus 


405158 


9966252 


Rus 


405187 


7229826 


Plus 


405223 


7239614 


Rus 


405340 


6094635 


Plus 


405494 


8050952 


Minus 


405551 - 


1552506 


Rus 


405654 


4895155 


Minus 


405657 


4726099 


Plus 


405673 


4589984 


Rus 


405704 


4204244 


Plus 


405723 


9801668 


Rus 


405727 


963833^ 


Minus 


405760 


6066938 


Minus 


405779 


7280331 


Minus 


405944 


7883702 


Minus 


406002 


8247797 


Minus 


406016 


8272661 


Rus 


406097 


7107918 


Minus 


406104 


9124028 


Rus 


406156 


7144867 


Pius 


406207 


5923650 


Minus 


4063Q0 


6479046 


Minus 


406308 


9211532 


Rus 


406314 


9211609 


Minus 


406317 


9211652 


Pius 


406432 


9256504 


Rus 


406490 


7711309 


Minus 


406584 


3983530 


Minus 



195808-196863 

37346>37633 

44058-44803 

58962-59294 

123298-124035 

15758-15930 

26987-27778 

175426-175667 

143456-143626.143808-143935 

158189-158433 

141829-142006 

162877-163118 

7999040237 

76723.77027,79317-79484 

54089-54163,55427-55623 

19288-20076 

95177-95435 

4156541881 

125117-125287 

31409-31674 

70823-70990 

64099^4260 

105655-106050 

4241042544,83317-83540,8684046922,87970-881 10 

12450-12753 

34972-35182 

95602-95969 

113738-113858 

104545-104757 

101165-102597 

109718-109847,109927-110202 

97326-97808 ■ 

171477-172316 

77030-77280 

46107-46439 

51086-51301 

153339-153481,155099-155294 

127170-127358 

31352-31498 

80430-81581 

59449-60477 

240588-241589 

22310-23269 

119832-120016,124110-124275 

49330-49498 

142358-142546 

158959-159423 

102529-102633 

42873-43056,4381543949 

117025-117170,118567-118736 

106184-106313 

4964449760 

70284-70518 

12525-12997 

53624-53759 

5798-5914 

50700-50842 

138342-139051 

114896-115831 

78865-79664 

37424-38045 

33048-33856 > 

5143-5684 

154007-154579 

4134141940 

35698-37269 

35309-35977 

379-597 

162607-162800 

19234-19401 

358408-358651 

12899-13011,18022-18136 

108018-108410 

3804-3930,40264120.49294109 

B029540480 

39894497 
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TABLE 9A: Genes predictive of no bladder cancer progression ' 

Pkey: Unique Eos probesel Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UmgenejO: Unigene number 

Unigene' Title: Unigene gene title 

R1 80th percentile of Ta or T1 tumor Als from patients who did not upstage divided by the 80th percentile of Ta or T1 tumor Als from patients who did upstage 
52 median of Ta or T 1 tumor Als from patients who did not upstage divided by the median of Ta or T1 tumor Als from patients who upstaged 



Pkey 


ExAccn 


Unigene ID 


Unigene Title 


R1 


R2 


408000 


L11690 . 


Ms. 198689 


bullous pemphigoid antigen 1 (230/240kO) 


4.64 


5.86 


412129 


M21984 


Hs.73454 


troponin T3, skeletal, fast 


4.54 


5.10 


459290 


NMJJQ1546 


Hs.34853 


inhibitor of DNA binding 4, dominant neg 


4*37 


1.63 


400844 






NM_003105":Homo sapiens sortilirKelated 


3.69 


5.90 


419555 


AA244416 




gb:nc07d11.s1 NCI_CGAP_PM Homo sapiens 


3.61 


2.03 


414522 


AW518944 


Hs.76325 


step II splicing factor SLU7 


3.60 


1.00 


440509 


BE410132 


Hs. 134202 


ESTs, Weakly similar to T17279 hypotheti 


3.58 


1.04 


445162 


AW189787 




ESTs 


3.57 


270 


407151 


H25836 


Hs.301527 


ESTs, Moderately similar to unknown [H.s 


3.56 


1.48 


421314 


BE440002 


Hs. 180324 


Homo sapiens, clone )MAGE:41B3312, mRNA, 


3.28 


325 


429663 


M68874 


Hs.211587 


ohosohoiioase A2 arouo ft/A (cvtasaftc 


3.25 


250 


430702 


U56979 


Hs.278568 


H factor 1 {complement} 


3.20 


270 


412420 


AL035668 


Hs.73853 


bone morphogenetic protein 2 


3.20 


230 


420729 


AW964897 


Hs.290825 


ESTs 


3,20 


1.53 


433376 


AI249361 


HsJ4122 


caspase 4, apoptosis-related cysteine pr 


3.00 


4,10 


420028 


AB014680 


Hs.8786 


carbohydrate (r^cer/lgluccsami/ie-6-O) s 


2.94 


278 


407881 


AW072O03 


Hs.40968 


heparan sulfate (glucosamine) 3-O-sulfot 


2.93 


1.43 


426283 


NMJXJ3937 


Hs.169139 


kynureninase (L-kynurenine hydrolase) 


2.93 


1.33 


428030 


A19T5228 


Hs.11493 


Homo sapiens cDNA FU 13536 fis, clone PL 


2.92 


247 


419713 


AW968058 


Hs.9238 1 


nudix Inucl&asidQ dinhosnhala Itnlfpd mni 


2.89 


3.33 


414407 


AA147026 


Hs.76704 


ESTs 


2.87 


2.87 


450779 


AW204145 


Hs. 156044 


ESTs 


2.78 


1.86 


411243 


AB039886 


Hs.69319 


CA11 


2.73 


1.00 


417878 


090916 


Hs.82845 


Homo sapiens cONA: FU21930fis, clone H 


2.68 


232 


441619 


NM_014056 


Hs.7917 


DKF2P554K247 protein 


2.67 


298 


446619 


AU076643 


Hs.313 


secreted phosphoprotetn 1 (osteopontin, 


2.66 


1.00 


440006 


AK000517 


Hs.6844 


hypothetical protein FU 20510 


2.64 


1.77 


426252 


BE1 76980 


Hs.28917 


ESTs 


2.63 


7.30 


424008 


R02740 


Hs.137555 


nulative chemokins recenfor GTP-hinriinn 


2.60 


253 


429429 


AA829725 


Hs.334437 


hypothetical protein MGC4248 


259 


3.34 


427450 


AB014526 


Hs.178121 


KIAA0626 gene product 


2.57 


228 


420180 


AI004035 


Hs.25191 


ESTs 


2.56 


1.68 


434061 


AW024973 


Hs.283675 


NPO009 protein 


2.54 


2.10 


422070 


AF149785 


Hs.111126 


pituitary tumor-transforming 1 interact 


2.54 


3.25 


419355 


AA428520 


Hs.90061 


progesterone binding protein 


2.53 


3.63 


446215 


AW821329 


Hs.14368 


SH3 domain binding glutamic acid-rich pr 


2.52 


4.38 


432442 


A1672516 


Hs.178485 


' ESTs, Weakly similar to S65657 aipha-1C- 


250 


4.60 


447B87 


AA1 14050 


Hs.19949 


caspase 8, apoplosis-retated cysteine pr 


249 


2.23 


401155 






Target Exon 


2.48 


230 


404530 






Target Exon 


248 


1.00 


446006 


NMJ)04403 


Hs.13530 


deafness, autosomal dominant 5 


2.48 


235 


436476 


AA326108 


Hs 33829 


bHLH orotein DEC2 


2.47 


288 


446535 


AF257175 


Hs. 15250 


nernxisnmal D3 D5-finovL(V\A taonierase 

p^i uaioUm Lai mmiL/C 01 r*\AJr\ tout 1 ivlGww 


2.46 


2.19 


408636 


BE294925 


Hs.46680 


CGM2 orotein 


2.45 


1.60 


420962 


mm 005904 


Hs 100602 


MAD ffTtrtthprc flflainst riepanfintanlfttiir Dr 


2.44 


3.75 


427008 


Z45258 


Hs.286013 


short coiled-coil protein 


2.42 


340 


459711 


BE386801 


Hs.21858 


trinucleotide repeat containing 3 


2.40 


278 


407910 


AA650274 


Hs.41296 


fibronectin leucine rich transmembrane p 


2.40 


1.00 


410337 


M83822 


Hs.62354 


cell division cycle 4-like 


2.39 


3.68 


435029 


AF167706 


Hs.19280 


cyslelne-ricb motor neuron 1 


2.39 


3.23 


437181 


AI306515 


Hs.125343 


ESTs, Weakly similar to KIAA0758 protein 


239 


1.00 


410968 


AA1S9907 


Hs.67397 


homeo box A1 


238 


1.33 


422511 


AU076442 


Hs.1 17938 


collagen, type XVII, alpha 1 


23B 


6.40 


450775 


AA902384 


Hs.73853 


bone morphogenetic protein 2 


2.38 


271 


442433 


BE243044 


Hs.8309 


KIAA0747 protein 


237 


3.68 


454000 


AA040620 


Hs.5672 


hypothetical protein AF140225 


2.36 


1.14 


447701 


BE619526 


Hs.255527 


hypothetical protein MGC14128 


236 


202 


427985 


AI770170 


Hs.29643 


Homo sapiens cDNA FU13103 fis, done NT 


236 


218 


442257 


AW503831 


Hs.323370 


Human EST clone 25267 mariner transposon 


235 


3.53 


454070 


N79110 


Hs.21276 


collagen, type IV, alpha 3 (Goodpasture 


235 


3.60 


418452 


BE379749 


Hs.65201 


C-type (calcium dependent, carbohydrate- 


235 


248 


42121B 


NMJJ00499 


Hs.72912 


cytochrome P450, subfamily 1 (aromatic c 


235 


1.00 


407793 


AW080879 


Hs.236572 


gb:xc38g04j(1 NCLCGAP_Co20 Homo sapiens 


235 


1.21 


442061 


AA774284 


Hs.285728 


aW-interactor 12 (SH3-containing protei 


234 


3.03 


402845 






ENSP00000246267:KIAA0444 PROTEIN (FRA6ME 


234 


1.52 


411407 


R00903 


Hs.169793 


ribosornal protein L32 


234 


0.77 


418506 


AA084248 


Hs.85339 


G protein-coupled receptor 39 


234 


0.72 
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5 
10 
15 
20 
25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



424637 

413804 

411060 

430028 

417720 

436396 

454219 

444745 

408179 

442679 

458949 

407191 

448367 

405155 

445594 

417458 

430315 

411945 

408937 

431474 

434094 

420997 

420164 

414099 

424800 

459005 

416290 

439208 

401563 

404687 

443303 

439866 

400835 

456655 

431689 

418026 

451131 

406038 

434078 

441623 

459244 

424720 

404204 

453987 

411400 

454949 

409223 

418030 

433364 

459511 

437559 

418827 

417470 

421012 

448772 

439601 

434417 

424865 

400752 

438916 

430024 

409345 

421939 

442315 

419591 

458025 

428582 

422749 

433091 

456421 

421508 

402760 

406274 

406897 

409632 

445320 

442271 



NMJ315057 

T64682 

NM.006074 

BE564110 

AA205625 

A/683487 

X75042 

AF117754 

AL042465 

R53718 

AW291777 

AA608751 

A1955411 

AW058463 

NMJJ05655 

NM_0O4293 

AL033527 

AA210734 

AI133990 

AA305599 

AK001214 

AW339037 

U11313 • 

AL035588 

AA447679 

NfvL0Q0Q16 

AK000299 



U67319 

AA280717 

AW853954 

AF035528 

AA305688 

BE379727 

AI267586 

Y14443 

AW880709 

AA315805 

AW503990 



AA323750 

M311919 

AW847318 

AA312572 

BE207573 

AI075407 

A1142379 

AI678033 

BE327311 

AF112219 

X53281 

AW390822 

AB029032 

AL110157 

AF011333 

AW188464 



AI949109 

BE169531 

AA173992 

AF090900 

AI275406 

BE336699 

W01076 

Y12642 

AL157485 

NMJXM833 



M57417 
W74001 
AA503887 
Af 000852 



Hs.151411 

■Hs.3m01 
Hs^27750 
Hs.208067 
Hs.152213 
Hs.44313 
Hs.11861 
Hs.43445 
Hs.107882 
Hs.346137 

Hs.94109 

Hs.12940 
Hs.82173 
Hs.239147 
Hs.92137 
Hs.291386 
Hs.190642 
Hs.238205 
Hs.100914 
Hs.24908 
Hs.75760 
Hs,153203 
Hs.144558 
Hs.79158 
Hs.180952 



WAA0916 protein 

gb:yc48b02r1 Stratagene liver (937224) 
Homo sapiens mRWA full tength insert cDN 
Target CAT 
ESTs 

wingless-type MMTV integraSon site fami 
v-rel avian reBculoendotheliosis viral 
thyroid hormone receptor-associated pro! 
poly(A)-specific ribonudease (deadenyta 



Hs.9216 
Hs.6727 

Hs.f53863 
Hs.267695 
Hs.83213 
Hs.268012 

Hs.283683 

Hs.142442 
Hs.152292 

Hs.235025 

Hs£9851 

Hs.290131 

Hs.6241 

Hs.83321 

Hs.296083 

Hs.121476 

Hs.47166 

Hs.82193 

Hs.101025 

Hs.301528 

Hs;6606 

Hs.3843 

Hs.153563 

Hs.101515 
Hs.227730 

Hs.109727 

Hs.7956 

Hs.91393 

Hs.32450 

Hs.185055 

Hs.278573 

Hs.3185 

Hs.91973 

Hs.105115 



Hs.55279 
Hs.167011 
Hs.8180 



ESTs, Weakly similar to T08599 probable 
gb:ae56h07.s1 Stratagene lung carcinoma 
Homo sapiens cDNA FU13634 fis, clone PL 
Target Exon 

zinc-fingers and homeo boxes 1 
TGFB inducible early growth response 
guanine deaminase 

v-myc avian myelocytomatosis viral oncoq 
ESTs 

CEGP1 protein 
hypothefcal protein PRO2013 
hypothetical protein FU10352 
ESTs 

sterol carrier protein 2 
MyoD family inhibitor 

ESTs, Weaidy similar to ALU 1.HUMAN ALU S 
acyl-Coenzyme A dehydrogenase, C-4 to C- 
dynactin4(p62) 
C15001262:gil730498llref]NP 038528.1) ca 
C9000375*:gil11994617|dbj]BAB02754.1|(A 
caspase 7, apoptosis-related cysteine pr 
Ras-GTPase activating protein SH3 domain 
chromosome 2 open reading frame 2 
MAD (mothers against decapentaplegic, Dr 
UDP-GaJ:betaGicNAc beta 1,3-galactosylir 
fair/ acid binding protein 4, adipocyte 
fatty-acid-Coenzyme A ligase, long-chain 
zinc finger protein 200 
chromosome 8 open reading frame 4 
desmogiein 2 
HP1-BP74 

SW1/SMF related, matrix associated, acti 
ENSP00000252204*:2nc finger protein 165 
Homo sapiens, clone IMAGE:4247529, mRNA, 
nucleolar protein family A. member 1 (HI 
KIAA1819 protein 

phosphoiriosilide-3-kinase, regulatory su 
neuromedin B 

ESTs, Moderately similar to 154374 gene 
gb:qg64c01.r1 SoaresJestisJJHT Homo sap 
ESTs 
HT021 

esterase D/formylglutathione hydrolase 
basic transcription factor 3 
L*kynurenine/alpha-aminoadipate aminotra 
K1AA1 109 protein 

Homo sapiens mRNA; cONA DKF2p586F2224 (f 
lymphocyte antigen 75 
NM_003J05':Homo sapiens sortilin-related 
ESTs 

integrin, alpha 6 
hypoihefica/ protein FU2Q783 
TAK1-binding protein 2; KIAA0733 protein 
ESTs, Moderately similar to ZN91 HUMAN Z 
Homo sapiens cONA: FU21887 fis, done H 
gb:ql63c10jc1 SoaresJJhHMPu_S1 Homosapi 
BENE protein 

C059 antigen p18-20 (antigen identified 
rymphocyte antigen 6 complex, locus D 
hypothetical protein 
absent in melanoma 2 

NM_02179r:Homo sapiens eosinophil chemo 
Target Exon 

gb:Homo sapiens mucin (mucin) mRNA, part 
serine (or cysteine) proteinase inhibito 
Homo sapiens cDNA; FU213S2 fis, clone C 
syndecan binding protein (syntenin) 

199 



2.32 

232 

232 

232 

2.32 

231 

130 

2.30 

2.29 

229 

228 

227 

227 

226 

226 

2.25 

224 

224 

224 

223 

222 

222 

222 

221 

221 

221 

220 

220 

220 

219 

219 

219 

218 

216 

217 

217 

217 

217 

217 

2.17 

217 

217 

2.17 

2.16 

2.16 

2.16 

216 

2.16 

216 

216 

2.15 

2.15 

2.15 

2.15 

2.15 

215 

2.15 

215 

214 

214 

214 

ZU 

213 

2.13 

213 

212 

212 

212 

211 

211 

211 

209 

209 

209 

209 

209 

209 



255 
1.46 
290 
228 
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446493 
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414895 


AW894856 


442072 
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425723 
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432501 


A1554929 



Hs.7395 

Hs.130650 

Hs.78465 

Hs.133487 

Hs.108920 

Hs.109701 

Hs.194477 

Hs.119140 

Hs.75083 

Hs.26fi122 



Hs.205173 

Hs.7890 

Hs.89901 

Hs.161031 

Hs.30212 

Hs.189999 

Hs.173705 

Hs,184544 

Hs.251754 

Hs.35096 

Hs.42959 

Hs.26756 

Hs.25566 

Hs.221441 
Hs.166196 

Hs.575 

Hs.300697 

Hs.301404 

Hs.261734 

Hs.3903 



Hs.79067 
Hs.20010 

Hs.288361 
Hs.63325 
Hs.276712 

Hs.301914 

Hs.20993 

HS.1082IQ 

Hs.98135 

Hs.22891 

Hs.1 11862 

Hs.83916 

Hs.130881 

Hs.920 

Hs.107882 

Hs.76888 

Hs.173328 

Hs.26192 

Hs.97726 

Hs.145409 

HS.1326B 

Hs.35660 

Hs.31731 

Hs.65450 

Hs.104991 

Hs.121924 

Hs.2642 

Hs.5548 

Hs.129053 

Hs.15144 

Hs. 116278 

Hs.12311 

Hs.159311 

Hs.281866 



hypothetical protein FU23182 
ESTs 

v-jun avian sarcoma virus 17 oncogene ho 
ESTs 

HTO1 8 protein 

Ubiquitin-fite 5 

£3 ubiquitin (igase SMURF2 

eukaryoltc translation initiation factor 

ENSP00000251 675*:K1AA1 550 protein (Fragm 

human immunodefioency virus type I enha 

Target Exon 

hypothetical protein 

ESTs 

ESTs 

Homo sapiens mRNA for K1AA1671 protein, 
phosphodiesterase 4A, cAMP-specffic (dun 
Homo sapiens mRNA; cDNA DKFZp434K0322 (f 
thyroid receptor interacting protein 1 5 
purinergic receptor {ramify A group 5) 
Homo sapiens cONA: FU22050 lis, clone H 
Homo sapiens, clone IMAGE:3355383, mRNA, 
secretory leukocyte protease inhibitor { 
KIAA1538 protein 
KIAA1012 protein 
hypothetical protein FLJ20896 
ESTs, Weakly similar to 2004399A chromos 
C7000789:gi|1943947|gblAAC48716.t| (U901 
ESTs 

ATPase, Class I. type 8B, member 1 
C15000305:gi|3806122lgb|AAC69198.1l(AFO 
aldehyde dehydrogenase 3 family, member 
coagulation factor VII (serum prothrombi 
RNA binding motif protein 3 
Homo sapiens cDNA: FU22807 Ms, clone K 
Cdc42 effector protein 4; binder of Rho 
U5 snRNP-specific protein, 1 1 6 kD 
chromosome 21 open reading frame 15 
C7000789:gi|1943947|gbJAAC48716.1| (U901 
MAD (mothers against decapentaplegic, Or 
ESTs 

gb:CMO-HT0395-280100-169-b09HT0395 Homo 
Homo sapiens cDNA: FU22696 fc. done H 
transmembrane protease, serine 4 
Homo sapiens cDNA FL) 1 1 074 fis, done PL 
NM_007037*:Homo sapiens a disintegrin-fi 
gb:RC6-BT0627-220300-O12-O06 BT0627 Homo 
high-glucose-regulated protein 8 
hypothetical protein FU10813 
hypothetical protein FU20559 
sdute carrier ramify 7 (cationic amino 
K1AA0590 gene product 
ESTs 

NM_005000*:Homo sapiens NADH dehydrogena 
B-ceil aUymphoma 1 1 A (zinc finger pro 
ESTs 

hypothetical protein FU 10659 
hypothetical protein MGC12702 
protein phosphatase 2, regulatory subuni 
ESTs, Weakly similar to ALU1.HUMAN ALU S 
ESTs 

RAB, member of RAS oncogene family-iike 

C2000428^ij7705383|ref|NP_057536.1| GC 
BUP protein 
peroxiredoxin 5 
reiiculon 4 

hypotheBcal protein similar to preferen 
ESTs 

eukaryoBc translation elongation factor 
f-box and teurine-rich repeat protein 5 
Target Exon 

Homo sapiens NOTCH 1 (N1) mRNA, complete 
hypothetical protein DKFZp564O043 
Homo sapiens cDNA FU 13671 fis, done PL 
Homo sapiens clone 23570 mRNA sequence 
dickkopf (Xenopus faevis) homolog 4 
ATPase, H transporting, lysosomal {vacuo 

205 



1.51 


Z50 


1.51 


1.64 


1.51 


2.62 


1.51 


2.02 


1.51 


2.22 


1.51 


1.34 


1.51 


2.24 


1.51 


1.49 


1 51 

1.9} 


O 1ft 


1.51 


1.33 


1.51 


3.22 


1.51 


1.57 


1.51 


1.59 


1.50 


2.70 


1.50 


3.53 


1.50 


2.24 


1.50 


1.13 


1.50 


1.62 


1.50 


1.00 


1.50 


1.05 


1.50 


2.53 


1.49 


0.97 


1.49 


1.44 


1.49 


2.24 


1.49 


2.40 


1.49 


2.84 


1.48 


2.20 


1.48 




1.48 


2.24 


1.48 


1 4ft 


1.48 


1.16 


1.48 


1.20 


1.48 


2.38 


1.48 


1 9ft 


1.48 


9 71 


1.48 


9 W 


1.48 


9 CV\ 


1.48 


9 nn 


1.48 


1 9R 
I. ZD 


1.48 


9 1% 


1.48 


9 nn 

Z.UQ 


1.48 


1.00 


1 47 ' 


i Q1 

1.3 I 


1.47 


9 9ft 
Z.ZD 


1.47 


1 94 


1.47 


2.00 


1.47 


9 on 


1.47 


4.08 


1.47 


0. 10 


1.47 


1.06 


1.47 


1 51 


1.47 


2.39 


1.47 


1.00 


1.46 


9 9ft 


1.46 


2.13 


1.46 


1.98 


1.46 


1 45 


1.46 


2.31 


1.46 


1.30 


1.46 


2.48 


1.46 


1.05 


1.46 


2,12 


1.46 


Z40 


1.46 


1.65 


1.46 


1.56 


1.46 


1.49 


1.46 


2.45 


1.46 


2.00 


1.45 


1.60 


1.45 


1.28 


1.45 


2.21 


1.45 


2.40 


1.45 


3.65 


1.44 


2.71 


1.44 


1.08 


1.44 


2.24 


1.44 


1.63 



WO 03/003906 



PCT/US02/21338 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



412210 


AWQ014Q9 




421685 


AF1RQ791 

f\r 103 1 £.0 


He 10R77R 


428115 


AB023194 


Me OrtflOer 


442358 


RP5R7QRR 

□ COO/ 300 


ns. 10000 


414685 


LuaO/4 


ns./oow 


413798 


AA11R70R 


ns./oo^o 


410937 


nJ\c I030H 


Mc R7019 
n5.0f UDZ 


400397 


Al 97077ft 




405902 






433976 


AAR90QR7 




405376 






HOOUOQ 


£.40100 


Me OOR1 

ns.yyoi 


418182 


AW01R401 


Mc 1RR4R 
no. 1 OO'+O 


41(1107 




Me OODQyM 

nS.ZJoy44 


434924 


A 44411 R4 


Me OOOCQ 


Hi IOC \ 


DCZ4D l**3 


Mc R9R40 
nS.0Z040 


404744 






405418 






402869 






451608 


AAlfl4R9 t » 


Mc 9R74R 


424099 


AP071909 


Mc 1 1Q11R 


401041 






417839 


' AlR1*J719 
nIOlOf OZ 


Mc R9719 

ns.oz/ iz 


409245 


AA1RM17 


Mc 9RR01A 
flb.ZOOv/OO 


HHf OUO 


MM 0079RR 
IN [V1_UU / ZOO 


Me 1QR70 

ns.iyoi O 


45R4Q9 


A A 110047 


Mc 1Q11A7 


44Q944 


A\AiflCQQ7Q 


HS.OZZU4 


41WQ4 


U941R4 

nz** i o*» 


Me OK/110 
nS,Z0*+ \i 


HOZWf 


A AR09QOQ 


U c OO0R0 


407R74 


mVUO'lUO I 


ns.z/y 140 


4419Q7 


AUW/ffiOflfl/t 
AWHUOUO*} 


ns./ f 00 


491 Q19 
HZ I gOZ 


WO I f / O 


Mc 191Q4Q 


49R14R 
**ZOO**0 


QC4RRRRR 
□t^DDOOD 


Mc 17411 


40 ZOO? 


A147QR1 1 

Hi^/yo jo 


Mc 97RA11 
nS.Z/04l 1 


411 71R 


A\A/Q7779A 


Uc 

ns./oyoo 


4Z3300 


MM 00417R 


Uc 99R1R1 
nS.ZZOOu 1 


444017 


AVA47RRR 
MVO'+fOOQ 


He 49711 
ns.4Zf 00 


4091 44 

HVZ I It 








AA19C17n 


Uc 0OAR07 
rlS.ZZ'JoZ/ 


419179 


RPRRR141 


He 9RQRR 


rococo 


Al IT7CC7 
ALIO/ 09/ 


Mc 179RA1 
MS. 1 /Z0U0 


410664 


AA083500 


Hs.170298 


401784 








btz4z/ ry 


ns.i/yozo 




A1CCACQO 

Aloo4ooz 


nS.i/oo/U 


4Uo4D/ 






444oo9 


TQCCCC 


Up 01 CM 

nS.olooz 


401D00 


AJUZf04J 


Uc lonon 


viioodo 


Bboyzuzb 


nS.oo4o40 


447537 


Avvzyou/z 


Up oxc/no 
nS.04b4Uo 


42821 1 


A AJIOA01 1 

AA4z4zl i 


Up 1 001 "Id 

MS. 1 00 I/O 


406248 






437412 


BcDd92Bo 


Hs.34744 


414653 


M24480 


n. TC7CQ 

HsJo7ofl 


403885 
439459 


AFuooZ79 


II- CfiAlO 

Hs.ooOio 


419075 


T842DD 


HS.1 23927 


4050Z2 






AftllAG 

4Urd4o 


BcU4145l 




415660 


AJ9U9UU7 


U«« 70 CCO 

HsJoodo 


448023 


Alo932yy 


HS. 170000 


435962 


A ATflOGO/1 


H. 004 00.4 

HS. 29 1294 


432480 


AA205475 


Hs.275855 


414109 
H I40U3 


AKOOOfilQ 


He 7^RR4 
no. / 000^ 


440256 


U23841 


Hs.18851 


413809 


U5851 


Hs.851 


408176 


AK001553 


Hs.43436 


433960 


AW629188 


Hs.188929 


404178 






402449 






455604 


BE011183 




429221 


AJ821060 


Hs.198271 


422122 


AA383642 


Hs.1 11894 


406231 






405879 






450936 
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gb:RC0-NN 101 2-270300-031-h 1 0 NN1012 Homo 
ATPase, Ca transporting, type 2C, member 
K1AA0977 protein 

ESTs, Moderately similar to ALU4.HUMAN A 
dCMP deaminase 

capping protein (actin filament) muscle 
vacuolar protein sorting 26 {yeast homo! 
transcription factor 7-iike 2 (T-cetl sp 
Target Exon 
ESTs 

Target Exon 

Homo sapiens cDNA: FU21954 fis, done H 
ESTs 

hypothetical protein FU10631 
hypothetica) protein FU13433 
protein tyrosine kinase 9 
Target Exon 
Target Exon 
Target Exon 
hypothetical protein 

ATP-binding cassette, sub-family C (CFTR 
C11000425:gi|4507721|reflNP_003310.1| U 
fragile X mental retardation, autosomal 
tRNA isopentenylpyrophosphate transferas 
suppressor of S. cerevisiae gcr2 
ESTs 
ESTs 

TOLUP protein 

brain-specific protein p25 alpha 
ESTs 

ubiquitin-conjugating enzyme E2E 1 (homo 
kangai 1 (suppression of tumorigenicity 
hypothetical protein FU20967 
NCK-associated protein 1 
thymosin, beta 4, X chromosome 
C0X15 (yeast) homoiog, cytochrome c oxld 
CHMP1.5 protein 
Target Exon 

ESTs, Weakly similar to FAHUAA alpha-act 
glutaredoxin (thioltransferase) 
hypothetical protein MGC10327 
ESTs 

NM_002280*:Homo sapiens keratin, hair, a 
upregulated by 1,25-dihydroxyvi(amin 0-3 
ESTs 

Target Exon 
ESTs • 
ESTs 

hypothetical protein MGC13045 

ESTs, Weakiy similar to AF1 93556 1 sacsi 

ESTs 

Target Exon 

Homo sapiens mRMA; cDNA DKF2p547C136 (fr 

procollagen-proline, 2-oxoglutarate 4-di 

Target Exon 

ESTs 

ESTs 

Target Exon 
hypothetical protein 

ubiquitin-conjugating enzyme E2G 1 (homo 

ESTs 

ESTs 

ribosomai protein S18 
DKFZP586A011 protein 
hypothetical protein FU10875 
integrin, alpha E (antigen CD103, human 
adenylate kinase 3 alpha tike 
ESTs 

C6XX)1430 , ^i|4503521|ref|NP - .001559.1| mu 
Target Exon 

gb:PM3-BN0216^1(J0500-003^09 BN0218 Homo 
Target CAT 

lysosomal-associated protein transmembra 
Target Exon 
Target Exon 

gb:ow23a10jc1 Soares_parathyroid_tumor w N 
ENSP00000231844*:Ecotropic virus Integra 

206 



1.44 


115 


1.44 


1.83 


1.44 


1.31 


1.44 


247 


1.44 


1.25 


1.44 


1.26 


1.44 


1.41 


1.44 


3.43 


1.44 


2.65 


1.44 


2.46 


1.44 


2.28 


1.44 


1.34 


1.44 


2.35 


1.43 


1.55 


1.43 


2.05 


1.43 


2.15 


1.43 


1.99 


1.43 


2.83 


1.43 


2.40 


1.43 


1.22 


1.43 


Z45 


1.43 


2.90 


1.43 


2.84 


1.43 


2.65 


1.43 


. 200 


1.43 


2.73 


1.42 


1.57 


1.42 


1.33 


1.42 


2.50 


1.42 


2.35 


1.42 


2.20 


1.42 


1.48 


1.42 


1.83 


1.42 


246 


1.42 


1.30 


1.42 


1.50 


1.42 


1.38 


1.42 


2.38 


1.42 


2.23 


1.42 


2.18 


1.41 


1.38 


1.41 


1.28 


1.41 


1.37 


1.41 


1.32 


1.41 


1.37 


1.41 


1.80 


1.41 


2.94 


1.41 


1.41 


1.41 


1.21 


1.41 


207 


1.41 


1.25 


1.41 


240 


1.41 


1.39 


1.41 


1.33 


1.41 


2.58 


1.41 


208 


1.41 


284 


1.40 


255 


1.40 


2.38 


1.40 


238 


1.40 


238 


1.40 


210 


1.40 


1.37 


1.40 


1.18 


1.40 


1.91 


1.40 


280 


1.40 


2.73 


1.40 


1.99 


1.40 


2.83 


1.40 


1.51 


1.40 


2.30 


1.40 


1.22 


1.40 


1.42 


1.40 


2.60 


1.40 


273 


1.40 


1.13 


1.39 


6.03 



WO 03/003906 



PCT7US02/21338 



453258 
448261 
427656 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



407704 

430138 

432841 

444051 

440704 

450092 

400275 

403725 

443211 

421510 

430071 

451545 

439897 

423872 

410344 

404439 

448581 

408559 

447643 

401593 



AW293134 

BE244072 

AI791495 

AW992356 

BE315072 

AA936296 

M93425 

N4B373 

M69241 

AW139S06 



AI128388 

AK0QC919 

AA355986 

A1802128 

NM-015310 

AB020316 

AW978435 

NM_002709 

BE066047 

H10767 



Ancicc 
406356 


N47812 


401886 




A'iA HA 

421 1 10 


AJ250717 


427449 


AW946384 


427451 


AI690916 


440681 


AW449596 


419590 


AF005043 


446044 


H67567 


400967 




414506 


AF075337 


402599 




422932 


AI191813 


433889 


AK002082 


429802 


H09548 


443856 


AK0Q0574 


453489 


AA300067 


424670 


W61215 


428995 


AW004975 


441551 


AA318224 


450528 


NMJM4072 


427605 


NMJ00997 


459237 


AA031675 


413691 


AB023173 


404906 




436246 


AW450963 


441478 


AA350018 


419715 


AF070523 


426251 


M24283 


400129 




450447 


Af 21 2223 


434697 


AL133033 


430308 


BE540865 


434767 


AF153201 


459729 


AL037285 


426653 


AA530892 


408912 


AB011084 


409844 


AW502336 


402517 




447042 


AB035863 


405000 




452065 


AK000360 


404666 




451081 


A1078645 


427979 


BE379776 


435825 


R16702 


426469 


BE297886 


447002 


BE242866 


410946 


AW811502 


454383 


AW500332 


440512 


AAB87845 


409865 


AW502208 


447390 


X95384 



Hs.32597 

Hs^0815 

Hs.180142 

Hs.8364 

Hs.78768 

Hs.234285 

Hs.62 

Hs.10247 

Hs.162 

Hs.221057 



Hs.143655 

Hs.f05f91 

Hs.232068 

Hs.208647 

Hs.6763 

Hs.134015 

Hs.62515 

Hs.21537 
Hs.86412 
Hs.238465 



Hs.1355 

Hs.178112 

Hs.178137 

Hs.166547 

Hs.91390 

Hs.13572 

Hs.76293 

Hs.308220 

Hs.3623 

Hs.5367 

Hs.9908 

Hs.33032 

Hs.1 16651 

Hs.194716 

Hs.296141 

Hs.25063 

Hs.337445 

Hs.31917 

Hs.75478 

Hs.119991 
Hs.301342 
Hs.92384 
Hs.168383 

Hs.25010 

Hs.4084 

Hs.238990 

Hs.289848 
Hs.171695 
Hs.48924 
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OKFZP5B6G011 protein, 
protein tyrosine phosphatase, norwecept 
activated leucocyte cell adhesion motecu 
insdin-tike growth factor binding prate 
ESTs, Weakly similar to ALU1_HUMAN ALU S 
NMJ)06513*:Homo sapiens seryURNA synth 
Target Exon 
ESTs 

hypothetical protein FU1Q057 
transcription factor 8 (represses inteil 
ESTs 

WAA0942 protein 
uronyl 2-sutfotransferase 
WAA0494 gene product 
ENSPW)000067222*:Mitochondrial 28S ribos 
protein phosphatase 1, catalytic subunit 
chromosome 9 open reading frame 5 
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gb:Hurnan HMGI-C chimeric transcript mRNA 
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Adlican 
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Homo sapiens mRNA full length insert cDN 
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eukaryotic translation initiation factor 
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chromosome 1 1 open reading frame2 
ESTs 

eukaryotic translation initiation factor 
thyroid hormone receptor-associated oral 
insulin-degrading enzyme 
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zinc ringer protein 208 
ESTs 
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. Target Exon 
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Target Exon 

splicing factor, araiNne&erine-rich 11 
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leukocyte receptor cluster (IRC) encoded 
ESTs 

chonVmicscmatomarrirmtropin hormone 1 (p 
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ESTs 
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Snf2-re!ated CBP activator protein 
ESTs 

ESTs, Weakly similar to BOX B BINDING FA 
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ESTs 
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insulbvlike growth factor 1 receptor 
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ESTs 
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NM_014587*:Homo sapiens SRY (sex determi 
• Eos Control 
ESTs 
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NM_02l733*:Homo sapiens testis-specific 
Homo sapiens mRNA; cDNA DKFZp564K1972 (f 
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peroxisomal biogenesis factor 6 
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fibroblast growth factor receptor 2 (bac 
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Target Exon 
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matrix metaltoprotetnase 28 
frequently rearranged in advanced T-cefl 
ESTs, Weakly similar to MGB4_HUMAN MELAN 
hypothetical protein FU12577 
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hypothetical protein DKFZp434P0531 
cytochrome c oxidase subunit Vb 
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FK506-binding protein 2(13kD) 
hypothetical protein MGC11352 
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Hs.1 84456 
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Hs.158992 
Hs.194382 
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Hs.301B66 
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Pkey: Unique Eos probeset Identifier number 
CAT number. Gene duster number 
Accession: Genbanx accession numbers 
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TABLE 9C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank identifier (G!) numbers. "Dunham 1 et al" refers to the publication entitled The ONA 

sequence of human chromosome 22." Dunham I. etal., Mature (1999) 402:489495. 
Strand: indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


disposition 


400634 


8567750 


Minus 


101102-101223,101886-102018 


400750 


8119067 


Plus 


198991-199168,199316-199548 


400752 


7331445 


Minus 


36215-36451 


400772 


8131629 


Minus 


34896-35021,4107841197 


400773 


8131629 


Minus 


4411644238,4820848321 


400788 


7342055 


Plus 


184369-184715 


400789 


8307741 


Plus 


82281-83693 


400835 


8954121 


Plus 


89366-89622 


400843 


9188605 


Plus 


5863-5970,7653-7784,8892-9023,9673-9807,10634-10789,15254-1540^,23827^ 


400844 


9188605 


Plus 


24746-24872,25035-25204 


400845 


9188605 


Plus 


34428-34612 


400846 


9188605 


Plus 


39310-39474 


400847 


9188605 


Plus 


4464344835 


400957 


7705148 


Minus 


66959-67241 


400967 


7770682 


Minus 


32697-32999 


401038 


7232177 


Minus 


42774469 


401041 


7232177 


Plus 


4475045076 


401058 


8117654 


Minus 


4522645414 


401155 


9438289 


Plus 


31381-31526 


401177 


9438503 


Minus 


62773-63330 


401196 


9719673 


Plus 


33138-33834 


401204 


9743388 


Minus 


33694-33872 


401286 


9801342 


Minus 


147036-147318 


401346 


9926605 


Minus 


12031-13032 


401371 


9650602 


Rus 


80901-81283 
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401383 


6721 1 35 


Mini )c 


1 RWA1_1 £71(11. 
133340-13/301 


401496 


71A17RQ 
/ oo i /oy 


Minus 


0O7OA Q1AAO * 

B2/8Q-8OU02 


401512 


7R9914R 


PlltC 

HUS 


100399-1 36557 


401563 


A947Qin 

QC'tl 3 IU 


Dine 

rlUS 


91395-91763 


401572 


03 f U£M 


Minus 


7oo5 1-78869 


401593 




Plus 


1A1CQ lAITJO < 4 00*3 4 00CC 

1U obo-lU57 1 1 293- 1 2356 


4U IOUU 


40DO7qb 


Minim 

Minus 


07*3C^ 17fC ■< O AflT'lT AflOAJ /V*#"1/* /Vrt-» , 4<J 

27363-Z751 8.287Z7-28891 ,29526-2973 1 


*4U I QUO 


7fiflQ0R1 

/ooyyoo 


Minus 


4.4CCCQ 4 4C7QO 

11b659-1 16780 


401613 




Dine 

rlUS 


224b1-Z2B31 


401654 


yu jf 1 qc 


Minus 


b4byD-o4/9/ 


HU 1 QQU 


y IUU004 


Minus 


i/Jbbi-l /4U^4 




O40O0DI 


rlUS 


770*5 QAKO 


*»U 1 f 0*4 


79AQ1QA. 

/.£43iyu 


Ptite 

rlUS 


1 4o Jbz-1 4oouo ( 1 49453- 1 49535, 1 49731 -1 49962 


HU 1 000 


711Q7AA 


nus 


*M99E7 1>107^0 
14«0/-14^i44 


W 1 uuu 


7229913 


Minus 


7091 IL7QOO*J 

rsi id- /yoyo 


401Q1R 

Hw 1 SOU 


oououy i 


Plus 


ARM7 ARQA1 
hqO 1 f-*K)y*W 


402001 


you 10 10 


PI IIC 

nus 


OaUO^-OObWcJ 


402087 


R117RAR 
Oil/ oho 


Plus 


1170RQ_1 17911 11B£7ft_11flfl , 9Q i 1flQfiO_1 lOHCA 

i j i uoy- 1 j / 1\ \ o ,1 ooo / o- 1 ooozo, i ooyoy-i osJou 


402138 


77n4QflR 
/ /unyoo 


Pius 


1*4 1 /O- 13 IUO 


402144 


794919R 


Plus 


11R49<L11RQ77 
1 I0HZ3-I 103/ ** 


402190 


03/ OUOf 


Minus 


7C^Q Q.7COC0 


402191 


R57fin71 

oo/cu/o 


Minus 


RGA 1 rLROCQI 
D341U-0330J 


in 9 too 


77GR79S 

/ / yo r 00 


Plus 


1CQ99 -IRIQn 1Q71C 00Cft7 OOCO/t 
13003- lDl0U,lo4l3-lO/l0,Z£0U/-'Z<db£4 


dtwRR 

HU4000 


yo joo / f 


Minus 


>*791fl /179in >lflAC9 /Q9A1 
4 » £ 1 0-4 f 0OU,4twd£-404U J 


4H9971 
4U10/ I 


3030304 


Plus 


bo Mo- bo yob 


/iA9i7Q 
4UZ3/0 


ybz3333 


Minus 


^414 0 >!4>tCO 4a < ]41 Jt070/1 

41 31 z-41 4oo,4o3 1 0-40/ 




9/ yob/ *» 


Plus 


COCC7 CAA1Q COCQQ COQOQ CO^CC COCOI CitQO'S CG4AQ 

oyob /-DuU33,bi;Doo-b/o*£o,bJ4b3-b3bti3,b^y^ 


4U£4/4 


7C/I71 7C 

/34/1 /3 


Mimic 

Minus 


COCOC C1COQ CC7CC CCOOA C7C0A CT7C7 

o3o^o-30ozo,oo / bo- ooyzu ,D / 530-3 1 (of 


4023 If 


yVyoiUo 


Pfus 


47CCQ 477*11 

17003-1/721 


4U2&yy 


7239565 


Dl.tr. 

PIUS 


CQOC CQQ7 

5o35-59o7 


40261/ 


Q01A707 

yyoUry/ 


Minus 


by4bb-b994t> 


/A9719 
4U2/1Z 


cyoyZoo 


Minus 


1U941-11130 


>IA9711 

4U<£/01 


yzi 1 boy 


Minus 


1 17011^1 IQrWd 101 4 4/V101914 1O't107 101/ffi7 10<;/47fl-1')Rft91 lOfilMA 1*5ACR1 

ii i , yio-iiouw l i^iiiu-izi£ii l itio^/-izi4 


v!A9717 
402/0/ 


OOlOIQvl 

yzizitw 


Minus 


411CtL11CCO 
10330-1033^ 


4U2/34 


on 17 ia 
9213/ 3U 


Oli i>* 

rlUS 


10343-1003/ 


4U2/0U 


yz looby 


rtUS 


nPOOQ ncOKO 11711C-117C14 

1 ooo/y- 1 0090A 1 o/ooo-l O/Wl 


402812 


CA1A4 1 A 

60101 10 


Plus 


OCAOC OCAQ4 OCQ>Jl OCQOA 

25026-2509 1 ,25 844-25920 


402845 


9369286 


Plus 


HC(\AC< 4CAC47 4CA70Q 4C4AAQ 

1 60451 -1 60 o 1 7, 1 6078&- 1 6 \ 003 


402869 


6434643 


Minus 


40QCOA 40QOOC 

138639-139335 


402892 


8086844 


Minus 


194384-194645 


403010 


3132346 


Plus 


TQOOC 7QAC1 

78385-79052 


403149 


9799833 


Plus 


25034-25185 


403326 


B440025 


Minus 


4 morn unoo 
110959-111122 


403329 


8516120 


Plus 


9&450-9b59o 


403342 


7233487 


Minus 


42312-43750 


403349 


8569773 


Minus 


1C7D4C 4Cfl*37>l 

16781 5-168374 


403381 


9438267 


Minus 


26009-261 76 


403463 


OQ9OE0Q 

9929530 


Plus 


1 AOCO(i_ 1 AOU7G 


403655 


0/ obUa J 


Din* 

rlUS 


bobbo-bob&y 


403725 


7534031 


Plus 


CC7*J7 QCQA1 

O0/0/-OD040 


403728 


7KQ/1004 

7334291 


Minus 


04401-040/1 


>iaoqa7 
40300/ 


0439933 


Minus 


1 C90C1_ 1 CC771 
lb«0O-103//3 


403826 


OQ1QOAQ 

983B2D9 


nus 


■19*1107 1949^fl 
1Z I iy /-12103a 


403885 


7710403 


Minus 


30259-33324 


403943 . 


7711864 


rlUS 


iPtmA 1 } MM\Qt\A mi999 1A1«;n9 
1 l/U /'*£.- 1 UUyU*}, 1 U 1 V 1 0 Uo 


403988 


8576087 


Plus ' 


4COC4 1C/ICO 

■ 13231-16462 


404029 


7671252 


Plus 


1AQ71fi_114119 
lUO/10-lllllt 


404058 


35487B5 


Plus 


Q01A7 1 A1 DAD 

yyoyz-iuiouu 


404069 


3168619 


Plus 


>I711A A7ARft 

4/01U-4/43U 


404178 


7630978 


Minus 


l/oO/O-l/oOoO 


404204 


3169112 


Minus 


7AQCQ QAIOI 

79o6o-00321 


404231 


mj art off 

8218035 


Minus 


C4A77 CIIOO 

ol0//-bi022 


404250 


9187145 


Minus 


OCAOa ICOIO 17DOO 0QA7C 

oouyy-obzi 2,3/320-000/3 


404273 


9885189 


Plus 


A770Q QQOQC QQCA4 QAQCC 

97789-98283,39501 -99855 


4U4040 


QfllQAQI 

yoJouyj 


r JUS 




404349 


7630858 


Minus 


61006-61187 


404391 


3135305 


Minus 


26030-26173,27852-27997 


404392 


3135305 


Minus 


29738-29857 


404397 


9558608 


Minus 


104042-104232 


404438 


6984205 


Plus 


6341 3-63553 


404439 


7139680 


Plus 


55316-55585 


404530 


6479107 


Plus 


3157-3304 


404596 


9958262 


Minus 


104807-105043 


404649 


9796926 


Minus 


100027-100399 


404666 


7272179 


Minus 


18677-18993 


404687 


9797554 


Minus 


128456-126565 


404744 


9167237 


Plus 


71 776-71 852,72885-7301 9,73700-73822,74692-74850 
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404751 




Pius 


404769 


RftGQ7'IO_ 


Mnus 


404790 


fZ3U9£>0 


Plus 


40490B 


700.1>ie0 


Minus 


404956 


7387343 


Plus 


404977 


07OOO;M 


Minus 


405000 


- 6957544 


Minus 


405022 


?oononii 
r33U3Q4 


Phis 


405053 


7CClri/M 

iboi 944 


Minus 


405058 


/O03OO0 


Pius 


405155 


□QfiftOOfl 

yyoozzo 


Plus 


405163 


99oozo7 


Minus 


405165 


ooccono 
990030^ 


Plus 




ooccorvo 
ybbo3Uz 


Pius 


405172 


9900/02 


Plus 




74 OQ7JIO 

7139743 


Plus 


405193 


7OO/W70 


Bus 


405194 


/4JUU/Z 


Plus 


405245 


7^49293 


Minus 


405365 


007C1IV} 

ZZral9z 


Minus 


405376 


1 CC1MH 

1552533 


Plus 


405418 


GQ07000 


Plus 




740B068 


Minus 


405443 


/4Uo 143 


Plus 


405474 


0/1*50701 

tw39/oi 


Plus 


405502 


Gin 44 1 


Minus 




AHA 0OC7 


Plus 




f 1 b?4b*0 


Plus 


405630 


yicooi ic 
tfOUOl 10 


Minus 




4»14jbU 


Plus 




ICC 

003 1 Zbo 


Minus 




707VIQCH 

72/4891 


Minus 


405812 


>i77Gcon 
4//003U 


Minus 




70COnO4 

7bb2031 


Minus 




075B747 


Minus 




6758795 


Plus 


405902 




Minus 


405932 


7767812 


Minus 


406038 


8389537 


Plus 


406140 


9168231 


Minus 


406181 


5923650 


Plus 


406231 


7417725 


Plus 


406248 


7417725 


Plus 


406274 


7543787 


Plus 


406356 


7107907 


Plus 


406404 


9256305 


Minus 


406467 


9795551 


Plus 


406557 


7711569 


Minus 


406575 


7711679 


Plus 



113799-114252.114393-114715 

175801-176823 

38611-38761 

100985-101126 

55883-56203 

43081-43229 



217163-217439 

157134-157430 

150740-151556 

130469-130723 

161171-161299 

6461-6845 

4052640891 

153027-153262 

65438^5740 

128187-128383 

190465-190645,193346-193610 
57560-58312 

119B67-120372,120481-120824,121029-121357 

28875-29099 

51B39-51953 

55716-55979 

90716-90887,101420-101577 

172005-172175 

50360-50584 

169232-169647 

47585-47688 

10^16^103291,105858-105993,110051-110126 

741-969 

66203-66832 

224961-226780 

29424-29764 

60377-60795 

54789-55457 

4m2-4m 

82322-83110 

123525-123713 

37764-37877 

49887-50219 

16586-16855 

17206-17641,17772.17968 

49711-50227 

932-1123 

18761-18973 

152569-152874 

182212-182958 

5446-5574,6170-6352 

142034-142473 
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TABLE 1QA: Ge nes preferentially expressed in non-invasive bladder tumors 

Pkey: Unique Eos probeset identifier number 

ExAccn; Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene Ulle 

5! 6Qtfi percentile of Ta tumor Ate divided by the 80th percentile of T2-T4 tumor Als 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


421110 


AJ250717 


Hs.1355 


calhepsin E 


8.23 


428651 


AF196478 


Ms.188401 


annexin A10 


5.78 


451668 


Z43948 


Hs.326444 


cartilage acidic protein 1 


5.53 


415511 


A1732617 


Hs.182362 


ESTs 


4.72 


426336 


AA503115 


Hs.1fi3752 


microseminoproteln, beta- 


4.66 


418026 


BE379727 


Hs.83213 


fatty acid binding protein 4, adipocyte 


4.62 


400752 






NM_003105*:Homo sapiens sortilirwelated 


3.99 


430315 


NWL004293 


Hs.239147 


guanine deaminase 


3.82 


403010 






C21000152:giI6226483tsplQ52118|YMO3_ERWS 


3.56 


404977 






Insulin-like growth factor 2 (somatomedi 


3,54 


426657 


NM 015865 


Hs.171731 


solute carrier family 14 (urea transport 


3.51 


400409 


AF153341 




Homo sapiens winged helix/forkhead trans 


a38 


400844 






NM_003105*:Homo sapiens sortilin-related 


3.27 


406081 






Target Exon 


3.22 


417275 


X63578 


Hs.295449 


paralbumin 


3.03 


402230 






Target Exon 


2.96 


454219 


X75042 


Hs.44313 


v-rel avian reticuloendothefiosis viral 


2.89 


403381 






ENSP00000231844*:Ecotropic virus Integra 


2.87 


426088 


AF038007 


Hs.166196 


ATPase, Class 1, type 88, member 1 


2.86 


452286 


AI358570 


Hs.1 23933 


ESTs, Weakly similar to ZN91.HUMAN ZINC 


2.69 


434061 


AW024973 


Hs.283675 


NPD009 protein 


2.66 


418406 


X73501 


Hs.84905 


cytokeratin 20 


2.65 


418818 


AA228899 


Hs.101307 


Homo sapiens HUT1 1 protein mRNA, partial 


2.59 


421594 


R45689 


Ks.21889 


• Homo sapiens cDNA FU 12978 fis, clone NT 


2.57 


403383 






Target Exon 


2.56 


435563 


AF210317 


Hs.95497 


solute carrier family 2 (facilitated glu 


2.55 


424800 


AL035588 


Hs.1 53203 


MyoD family inhibitor 


2.54 


404606 






Target Exon 


2,53 


418205 


L21715 


Hs.83760 


troponin I, skeletal, fast 


2.53 


431912 


AI660552 


Hs.76549 


ESTs, Weakly similar to A56154 Abl subst 


2.52 


413786 


AW613780 


Hs.13500 


ESTs 


2.51 


421100 


AW351839 


Hs.1 24660 


Homo sapiens cDNA: FU21763 fis, done C 


2.50 


416640 


BE262478 


Hs.79404 


neuron-specific protein 


Z5Q 


420729 


AW964897 


Hs.290825 


ESTs 


2.50 


402844 






C10001 18*:gi|9951 91 3|reflNP_062832. 1| pr 


2.48 


401093 






C12000586*:gi|6330167|dbj|BAA86477.1| (A 


2.46 


417720 


AA205625 


Hs.208067 


ESTs 


2.45 


400297 


AM 27076 


Hs.306201 


hypothetical orotein DKF2d56401278 


2.45 


403818 






Target Exon 


2.44 


440273 


AI805392 


Hs.325335 


Homo sapiens cDNA: FU 23523 fis, clone L 


2.44 


418060 


AA211589 


Hs.208047 


ESTs 


2.40 


400843 






NM_003105*;Homo sapiens sortilin-related 


2.38 


446006 


NM 004403 


Hs.13530 


deafness, autosomal dominant 5 


2.35 


401512 






NM_014080:Homo sapiens dual oxidase-llke 


2.34 


446847 


T51454 


Hs.82845 


Homo sapiens cDNA: FU21930 fis, clone H 


2.32 


417094 


NMJW6895 


Hs.81182 


histamine N-methyltransferase 


2.31 


436293 


AI601188 


Hs.120910 


ESTs 


2.30 


436246 


AW450963 


Hs.1 19991 


ESTs 


2.30 


447578 


AA912347 


Hs.136585 


ESTs, Weakly similar to JC5314 COC28/cdc 


2.29 


417381 


AF164142 


Hs.82042 


solute carrier family 23 (nucfeobase tra 


228 


426028 


NWL001110 


Hs.172028 


a disintegrin and metalloproteinase doma 


2.27 


431448 


AL137517 


Hs.306201 


hypothetical protein DKFZp56401278 


2.26 


437181 


AI306615 


Hs.125343 


ESTs, Weakly similar to KIAA0758 protein 


£23 


415025 


AW207091 


Hs.72307 


ESTs 


2.18 


412610 


X90908 


Hs.74126 


fatty acid binding protein 6, ilea) (gas 


2.04 


424099 


AF071202 


Hs.139336 


ATP-binding cassette, sub-family C (CFTR 


2.03 


433078 


AW015188 


Hs.121575 


Homo sapiens cDNA FU 12231 fis, done MA 


2.01 


416225 


AA577730 


Hs.188684 


ESTs, Weakly similar to PC4259 ferritin 


2.00 


411880 


AW872477 




gb:hm30f03jc1 NCLCGAPJhy4 Homo sapiens 


1.99 


452316 


AA298484 


Hs.61265 


ESTs, Moderately similar to G786JWMAN P 


1.89 


413804 


T64682 




gb:yc48b02.r1 Stratagene liver (937224) 


1.88 


432306 


Y18207 


Hs.303090 


protein phosphatase 1, regulatory (inhib 


1.76 


405364 






ENSP000O0239138 t :Guanme nucleotide-bind 


1.60 


414320 


U13616 


Hs.75893 


anxyrin 3, node of Ranvier (ankyrin G) 


1.52 


401929 






C17001690:gi|6005701|refl,NPJ)09099.1|AT 


1.00 
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Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

411880 12631 10_1 AW872477 BE088101 T05990 

413804 1390710J T64682BE1 68190 BE158256 

TABLE 10C 



Pkey: Unique number corresponding to an Eos probese! 

Ref: Sequence source. The 7 digit numbers in this column are Gen bank Identifier (Gl) numbers. Dunham I. et a).* refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et aL, Nature (1 999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400752 


7331445 


Kfinus 


36215-36461 


400843 


9188605 


Plus 


5863-5970,7653-7784,8892.9023.9673-9807,10634.10789,15254-15403,23827.23958 


400844 


9188605 


Plus 


24746-24872,25035-25204 


401093 


8516137 


Minus 


22335-23166 


401512 


7622346 


Plus 


136399-136557 


401929 


3810670 


Minus 


3167-3286,4216-4310 


402230 


9986312 


Minus 


29782-29932 


402844 


9369286 


Plus 


54958-553f3 


403010 


3132346 


Plus 


78385-79052 


403381 


9438267 


hfinus 


26009-26178 


403383 


9438267 


Minus 


119837-121197 


403818 


8962065 


Minus 


138360-138512,144656-144796 


404606 


9212936 


Minus 


22310-23269 


404977 


3738341 


Minus 


43081-43229 


405364 


2281075 


Minus 


48325-48491 ,491 36-49252 


406081 


9123861 


Minus 


38115-38691 
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TABLE 1 1A: Genes preferentially expressed In muscle-Invasive bladder tumors 



Pkey: 
ExAccn: 



Unique Eos probeset Identifier number 
Exemplar Accession number, Genbank accession number 
„ Unigene number 
Unigene Title: Unigene gene title 

Rl 80th percentile of T2-T4 tumor Ms divided by the 80th percentile of Ta tumor Als 



Pkey 

423961 

421948 

401780 

446619 

444381 



408243 
414183 
411573 
414522 
413063 
441633 
427337 
420859 
422168 
418870 
401781 
421116 
448429 
414020 
447526 
424247 
456525 
418007 
406663 



433336 
425118 
415994 
412326 
422158 
446921 
433470 
417880 
446500 
413324 
436729 
450455 
413731 
412429 
418283 
418299 
420899 



437446 
413441 
404854 
431319 
452432 
429679 
428330 
408380 
431103 
422545 
418203 



406687 
414359 
417259 
417497 
449335 
445033 
427274 
427527 
409142 
453331 
428036 
417365 



ExAccn 
D13666 
L42583 

AU076643 
BE387335 
AW014875 
Y00787 
AW957446 
AB029000 
AW518944 
AL035737 
AW958544 
Z46223 
AW458397 
AA5B6894 
AF147204 

T19132 
D17408 
NM 002984 
AL048753 
X14008 
AW468397 
M13509 
U246B3 
AA278921 
AF0179B6 
AU076611 
NM 002923 
R07566 
L10343 
AB012113 
AW960564 
BE241595 
U78093 
V00571 
BE621807 
AL1 17424 
BE243845 
AV650262 
S79895 
AA279530 
NM 001629 
X06256 
AA788946 
A1929374 

AA873350 
AW206008 
NM 006290 
122524 
AF123050 
M57399 
X02761 
X54942 
AW103364 
M31126 
M62194 
AW903B38 
AW402482 
AW1 50717 
AV652402 
NMJX55211 
A1809057 
AL136877 
AI240665 
AW068302 
BE185289 



UnigenelD 
Hs,136348 
Hs.334309 

Hs.313 
Hs.283713 
Hs.1 37007 
Hs.624 
Hs.301711 
Hs.70823 
Hs.76325 
Hs.75184 
Hs.1 12242 
Hs.176663 
Hs.1 00000 
Hs.1 12408 
Hs.89414 

Hs.101850 
Hs.21223 
Hs.75703 
Hs.303649 
Hs.234734 
Hs.100000 
Hs.83169 
Hs.293441 
Hs.1908 
Hs.31386 
Hs.1 54672 
Hs.78944 
Hs.73817 
Hs.112341 
Hs.1 6530 

Hs.82848 
Hs.15154 
Hs.75294 

Hs.25035 
Hs.75511 
Hs.75765 
Hs.83942 
Hs.83968 
Hs.100194 
Hs.149609 
Hs.101302 
Hs.75367 

Hs.302232 
Hs.283378 
Hs.211600 
Hs.2256 
Hs.44532 
Hs.44 
Hs.287820 
Hs.83758 
Hs.727 

Hs.75929 
Hs.81800 
Hs.82212 
Hs.345728 
Hs.72901 
Hs.174142 
Hs.293441 
Hs.50758 



Hs.1076 



Unigene title 
periosUn (OSF-2os) 
keratin 6A 

NM_005557*:Homo sapiens keratin 16 (foca 
secreted phosphoprolein 1 (osteopontin, 
ESTs, Weakly similar to S64054 hypotheti 
ESTs 

(nterteukin 8 
ESTs 

KIAA1077 protein 
step II splicing factor SLU7 
chitinase 3-like 1 (cartilage glycoprole 
norma) mucosa of esophagus specific 1 
Fc fragment of IgG, low affinity lllb, r 
S 100 calcium-binding protein A8 (calgran 
S100 calcium-binding protein A7 (psorias 
chemokine (C-X-C motif), receptor 4 (fas 
Target Exon . 

retinol-binding protein 1, cellular 
calponin 1, basic, smooth muscle 
small inducible cytokine A4 (homologous 
small Inducible cytokine A2 (monocyte ch 
lysozyme (renal amyloidosis) 
SI 00 calcium-binding protein A8 (calgran 
matrix metalloproteinase 1 (interstitial 
Immunoglobulin heavy constant mu 
proteoglycan 1, secretory granule 
secreted frizzled-related protein 2 
methylene tetrahydrofolate dehydrogenase 
regulator of G-protein signalling 2, 24k 
small inducible cytokine A3 (homologous 
protease inhibitor 3, skin-derived (SKAL 
small Inducible cytokine subfamily A (Cy 
transmembrane 4 superfamily member 1 
selectin L (lymphocyte adhesion molecule 
sushi-repeat-containing protein, X chrom 
corticotropin releasing hormone 
transmembrane 4 superfamily member 1 
chloride intracellular channel 4 . 
connective tissue growth factor 
GR02 oncogene 
cathepsm K (pycnodysostosis) 
integrin, beta 2 (antigen CD18 (p95), ly 
arachidonate 5-lipoxygenase-activating p 
integrin, alpha 5 (fibronecfin receptor, 
ESTs, Moderately similar to CA1 C RAT COL 
Src-like-adapter 
Target Exon 
ESTs 

Hdmo sapiens cDNA: FU21778 fis, clone H 
tumor necrosis factor, alpha-induced pro 
matrix metalloproteinase 7 (matrilysin, 
diubiquitin 

pleiotrophin (heparin binding growth fac 

fibronectin 1 

COC28 protein kinase 2 

inhibtn. beta A (activin A, activin AB a ^ 

matrix metalloproteinase 11 (stromelysin 

cadherin 11, type 2, OB-cadherin (osteob 

chondroltln sulfate proteoglycan 2 (vers 

CD53 antigen 

STAT induced STAT inhibitor 3 
mucin 13, epithelial transmembrane 
colony stimulating factor 1 receptor, fo 
Immunoglobulin heavy constant mu 
SMC4 (structural maintenance of chromoso 
ESTs 

Homo sapiens mRNA for caldesmon. 3* UTR 
small proiine-rich protein 1B (comifin) 



R1 

11.22 

10.87 

9.16 

8.40 

7.88 

7.73 

7.54 

7.00 

6.52 

6.42 

6.14 

6.04 

5.56 

5.62 

5.51 

5.48 

5.46 

5.41 

5.41 

5.32 

5.32 

5.27 

5.22 

5.17 

5.08 

4.93 

4.89 

4.74 

4.64 

4.39 

4.30 

4.29 

4.23 

4.22 

4.22 

4.20 

4.1 B 

4.15 

4.09 

4.00 

4.00 

4.00 

3.97 

3.95 

3.94 

3.91 

3.81 

3.77 

3.76 

3.72 

3.58 

3.58 

3.57 

3.52 

.3.49 

3.48 

3.41 

3.36 

3.32 

3.30 

3.25 

3.23 

3.23 

3.22 

3.18 

3.15 

3,10 

3.09 
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414622 


AI752666 


Hs.76669 


418478 


U38945 


Hs.1174 


417771 


AA8046S8 


Hs.82547 


413936 


AF113676 


Hs.297681 


406755 


N80129 


Hs.94360 


426653 


AA530892 


Hs.171695 


443623 


M345519 


Ks.9641 


443907 


AU076484 


Hs.9963 


422048 


NM_012445 


Hs.288126 


410204 


AJ243425 


Hs.326035 


438973 


AW959503 


Hs.60440 


420202 


AL036557 


Hs.95910 


422626 


AA344932 


Hs.118786 


442402 


NM_000954 


Hs.8272 


413902 


AU076743 


Hs.75613 


434858 


R50032 


Hs.159263 


407207 


T03651 


Hs.336780 


438855 


AW946276 


Hs.6441 


430413 


AW842182 


Hs.241392 


424909 


S78187 


Hs.153752 


419938 


AU076772 


Hs.1279 


416819 


U77735 


Hs.80205 


422562 


AI962060 


Hs.118397 


414081 


AW969976 


Hs.279Q09 


426406 


AI742501 


Hs.169756 


443950 


NNL001425 


Hs.9999 


418323 


NMJW2118 


Hs.1162 


414420 


AA043424 


Hs.76095 


415149 


X12451 


Hs.78056 


415213 


NM.0Q2933 


Hs.78224 


421848 


X15880 


Hs.108885 


452516 


AAO58630 


Hs.29759 


427676 


AA394062 


Hs.300772 


415198 


AW009480 


Hs.943 


424390 


AW815657 


Hs.182241 


426825 


AL133415 


Hs.297753 


452363 


AI582743 


Hs.94953 


407694 


U77594 


Hs.37682 



nkaCnamtde N-melhyltransferase 
cycfin-dependent kinase inhibitor 2A (me 
relinoic acid receptor responder (tazaro 
serine (or cysteine) proteinase inhibilo 
metaColhionein 1L 
dual specificity phosphatase 1 
complement component 1, q subcomponent, 
TYRO protein tyrosine kinase binding pro 
spondm 2, extracellular matrix protein 
early growth response 1 
ESTs, Weakty similar to serin protease w 
putative lymphocyte GQ/G1 switch gene 
metaUothionein 2A 

prostaglandin D2 synthase (21 kO, brain) 

CD36 antigen (collagen type I receptor, 

collagen, type VI, alpha 2 

tubulin, beta polypeptide 

Homo sapiens mRNA; cDNA DKFZp586J021 (fr 

small inducible cytokine A5 (RANTES) 

cell division cycle 25B 

complement component 1 , r subcomponent 

pim-2 oncogene 

AE-binding protein 1 

matrix Gta protein 

complement component 1, s subcomponent 

epithelial membrane protein 3 

major histocompatibility complex, class 

immediate early response 3 

cathepsinL 

ribonuclease, RNase A family, 1 (pancrea 
collagen, type VI, alpha 1 

RNA POLYMERASE I AND TRANSCRIPT RELEASE 

tropomyosin 2 (beta) 

natural killer cell transcript 4 

interferon induced transmembrane protein 



Homo sapiens, Similar to complement comp 
relinoic acid receptor responder (tazaro 



107 
102 
2.77 
2.75 
2.75 
2.67 
2.65 
2.64 
2.51 
2.46 
2.46 
Z44 

^44 

2.43 
2.42 
2.42 
2.30 
2.29 
2.20 
2.18 
2.17 
2.11 
2.07 
2.07 
Z03 
£01 
1.94 
1.90 
1.72 
1.70 
1.69 
1.64 
1.64 
1.60 
1.59 
1.51 
1.46 
1.44 



TABLE 11P 



Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

428036 28620.1 AW068302 AI754558 AI750727 A1762631 AA302174 AA327522 M641 10 AW859944 AW859989 AJ751995 AA769620 AI858829 AI924875 

. AI888836AA864291 A1685060 AW088029A1 924908 AW466328AI093800AA99 1651 AI254501 BE004703 AA334442AW938852AA194330 
AL046953 AA852866 AW391995 W30846 AW662928 W25261 AA042863 R99045 H97060 W03910 H94687 T88984 AL048165 T29632 
N31556 N36484 AI798679 AA989355 W23832 AA873789 A1743646 AA363587 A1814748 AW338990 N73740 N83666 AL047816 R24137 
R63433 AA524984 AA234043 AA195131 N99903 AA453669 A1240302 AA370271 A1950026 AW771049 AA1 21476 AA569557 AI752632 
AI355594 AI471 993 Al 1 59941 N94555 AI753138 N21537 H97881 N25769 AW068044 AA808425 R63380 AA384736AA384738AA852352 
AI073645 AA527950 AA525036 AA044414 AI752460 AA703054 RD121 6 AA897183 A1751996 T81078 H95047 AA573642 D58348 N20953 
AA437143 N95439 AA579540AW867056AA770090A10B51 80 Al 806799 AA426421 AI572513 R24081 AA853189 AA295620AA234044 
AA371020 AW994984 H20896 AW964438 AA318516 AA318499 AA318727 AA318211 AA318478 AA318444 AA31B307 AA318497 AA318448 
AA318309 AA318496 AA318213 AA318435 AA318424 AA318217 AA318523 AA318438 AA318487 AA318724 AA593165 AW994985 T69842 
A1251813 AA478174 AA447737 T68350 F07712 AA121 145 H08973 AA345212 BE000667 AW068210 AW608407 R05674 H16712 N85426 
N42354 H85516 BE147991 T281 1 3 R32662 AA384678 AW239275 H82382 AW840700 D58229 C04082 W45394 AW795667 R73973 
BE002409 AA042828 AA363555 AJ223812 AA344709 BE149590 R70995 W46881 W90778 N71242 AA534826 ALO40676 R23797 H96450 
AA062957 D79947 W46960 AW959278 AA295997 AA026215 AW579469 AW365135 AW365134 AW994353 AW972886 AW069166 
AA343690 AW888731 AJ751527 AA937490 AA937505 AI826715 BE465604 AI925532 A1858109 AW339097 AI858524 A1720571 BE046506 
AW384981 AA043908 AA375983 AA525181 AW068366AW070577 AW891837 N83985 AW1 82753 AI422979 AI679733 BE006555 AL048166 
A1081401 AI888821 AI626043 N37087 A1S24140 A1801 298 AA600048 A1753947 H89615 N66424 AW069756 A181 4880 A1982806 A1754287 
AJ971 816 AW069022 AW069069 AW069454 AA342989 AI07771 2 A1311467 AI087361 AI801015 W46993 AJ281324 AW191963 AI421675 
AI300881 A1356670 AA873156 A100421 9 Al 189685 AA478018 AA076063 AI445222 AI753124 A1521 569 AI925025 AI022358 AI475993 
H20846 AI223234 AI635123 AA579170 N30442 AW1 17889 AA807935 AA558975 A1306636 AA888963 AI952591 A1935835 AJ445293 H16713 
AW139833AA622122 AA972051 AJ280828 H09543 AI4S3725 AW069613 A1865615 AJ753921 AI368782 AJ633208 AI446651 W46961 N22201 
H82276 C16555 AA291477 AW440535 AW517755 AA669921 A1926777 AW6621 18 AA553369 N67B73 AW023948 C15861 C16601 AI251465 
AW079187 BE045090 AI273006 C16390 C16503 AI620823 F13661 N66864 Z2131 1 C16108 C16089 C16400 AA758273 A1287781 AA864676 
AW608074 AW385583 AI589944 AA665817 AW192979 AW469065 AA564048 H84715 C16417 AA731072 AA661574 C16487 N29477 
AW189997 A1370492 C16471 AA652809 AA936687 AA506512 C16306 AW028413 A1537935 AA528347 C16255 AW029046 C16202 
AI868152 A1524662 T94414 AI567041 AI619654 AW008486 A1075624 AA577434 AA345104 T30105 AA932002 C16585 A1750390 AW294265 
A1619552 AA669781 AA026678 A W1 32002 AW263919 C16562 AA759137 AA693351 Z40779 C16577 AA885045 AW073763 R45484 
AI520895 U54708 T49285 AI5681 26 AW006569 AI093317 AL119781 T61046 A1053563 H51958 AF114144 AA305739 AW950394 AW793928 
AW793910 AL047737 AV659047 AV659632 AI750369 AA092053 AA092798 H85367 T61597 R23745 Z20418 T78485 A1751528 AW068121 
AA853188 A1752459 AA85371 1 AW950663 R78964 R36359 R21626 R21 522 



221 



WO 03/003906 



PCT/US02/21338 



433470 3672.1 



436729 42585.1 



453331 96214.1 



AW960564 AA092457 T55890 D56120T92525 AI815987 BE182608 BE182595 AW080238 M90657 AA347236 AW961686 AW176446 
AA304671 AW583735 T61714 AA316968 A1446615 AA343532 AA0B3489 AA488005 W52095 W39480 N57402 D82638 W25540 W52847 
082729 D58990 BE619182 M3151B8 AA308636 AA1 12474 W761 62 AA088544 H52265 AA301 631 H80982 AA1 13786 BE620997 AW651691 
AA343799 BE613669 BE5471 80 BE546656 F1 1933 AA376800 AW239185 AA376086 BE544387 BE619041 AA45251 5 AA001806 AA190873 
AA180483 AA159546 F00242 AI940609 A1940602 AI189753 T97663 T661 10 AW062896 AW062910 AW062902 AI051622 AI828930 
AA102452 A1685095 A1819390 AA557597 AA383220 AI804422 AI633575 AW338H7 AW603423 AW606800 AW750567 AW510672 AI250777 
AA083510 AW629109 AW513200 AA921353 A1677934 AI148698 AI955858 AA173825 AA453027 AI027865 AW375542 AA454099 AA733014 
A1591384 R79300 R80023 AA843108 AA626058 AA844898 AW375550 AA889018 AI474275 AW205937 AI052270 AW388117 AW388111 
AA699452 AI242230 N47478 H38178 AA366621 AA 113196 AA130023 H39740 T61629 A1885973 AW083671 AA179730 AA305757 AI285455 
N83956 AA216013 AA336155 AW999959 T97525 AA345349 T91 762 AA771981 A1285092 AI591386 BE392486 BE385852 AA682601 
AI682884 AA345840 T85477 AA292949AA932079AA098791 D82607 T48574 AW752038 C06300 

BE621807 AI445461 A1346835 AI453743 AI564644 AI928364 AW984527 BE156214 AI694111 AI591358 C17504 C17476 C17963 C18304 
AW071625 A1678712 C17732 D57559 H61762 AI720939 AI262930 H27252 AA47971 2 AI927769 AA291465 AA155661 AI963432 A1567995 
AA421678 AI925607 AA292956 AA192448 AW192593 AI865838 A169690S AJ871950 AJ911921 BE619741 BE439796AI161312 AI597B01 
AI424384 AI093510 AI240988 AW820230 AI492554 BE044033 AW262737 AW008570 AA043216 AW629505 AA136645 AA037722 AA706057 
AA088439 AW806193 AW806183 AM79834 BE501957 AA129574 R3B1 14 AA649494 AA524526 BE3271 20 AW572531 BE219784 BE349186 
AW015724 AA043217 AW772000 Al 79981 4 Al 67 1727 AW779725 AA502832 A1470033 AA129575 W38161 AI972739 AA404570 AA627686 
AA723200 AA1 47228 AA903050 AI990245 AI075878 T32487 C06123 AA1 57944 AI800106 W60075 AI859160 AA478328 AW673152 
AA182640 AI990827 AW275048 AW103470 AI298935 AW471421 R79190 AW085158 W45410 AI333170 AW300456 AA662517 J55840 
AI823466A1692846M962397AW191997M136658AI251817BE044134AVV339104AW517762M724739R799^ 
AA037696 AA190966 AA757735 AW772283 AA010631 H80983 AI769516 H64985 AI061065 AI950693 AA085492 A1245632 H28594 
AW0B8968 BE156360 AI349390 A1621320 AI738844 AW194272 AA148284 AA953883 C06365 AA487893 AI927 '217^918523 AI453453 
A79K02A11B9366AI261359A^ 

AA581093 AI650338 W60032 M603586 AI686240 AW242958 AA719173 AI745717 AW675302 AI582462 AI244845 A^65439 F0^9 

SioS^ 

A1348877 AI798415 D1 1921 A1377596 AI983655 AI744233 C061 1 1 AI248307 AA94B565 AI224807 

A ^B5T53681 N77468 H51833 AA147247 R75732 C18450 R73999 A1095755 T49904 H03868 AA411580 R33395 AA410586 T4B869 

AI37M20T48870 R73906 R75632 H03612 AA909684 N5069S H02580 H12839 N58781 AA742532 AITOIfl ^H03502 8E208298 R68588 
AI350463 R31935 AW069127 AA41 1621 R25671 R36105 H12451 H03869 H51263 AA035486 R25109 R251 10 AA147933 



TABLE 11C 



sequence of human chromosome 22.' Dunham I. el al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



40™ 7249190 ^8617,28920-29045,291 3W9296.2941 1-29567.29705-29787.30224-30573 

401781 7249190 Mnus 83215^35.83531-83656,83740-83901,84237-84393,8495^85037,86290-86814 
404854 7143420 Plus 14260-14537 
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10 



15 



20 



25 



30 



35 



40 



55 



60 



65 



70 



75 




Pkey 
424503 
400289 
418007 
4i8738 
406964 
429278 
428227 
410102 
425247 
424735 
420159 
415511 
452461 
413324 
443211 
439926 
432222 



443171 
432829 
410553 
419183 
425721 
420370 
437852 



ExAccn 
NM.002205 
X07S20 
M13509 
AW388633 
M21305 
AF056085 
AA321649 
AW248508 
NM.005940 
U31875 
AI572490 
AI732617 
N78223 
V00571 
Ai\28388 
AW014875 
AI204995 
BE296227 
BE281128 
W60377 
AW016824 



456034 
421110 
451668 
408243 
440304 
414918 
418867 



45 405033 



50 



AC002115 

Y13645 

BE0O1836 

AK001666 

AW450979 

AJ250717 

Z43948 

Y00787 

BE159984 

AI219207 

D31771 

AF038007 



UnigenelD 

Hs.149609 

Hs.2258 

Hs.83169 

Hs.B682 

Hs.198612 

Hs.2248 

Hs.279727 

Hs.155324 

Hs.272499 

Hs.99785 

Hs.1 82362 

Hs.108106 

Hs.75294 

Hs.143655 

Hs.137007 

Hs.250822 

Hs.9030 

Hs.57772 

Hs.272068 

Hs.89663 

Hs.159309 

Hs.97234 

Hs.256897 

Hs.189095 

Hs.1355 

Hs.326444 

Hs.624 

Hs.125395 

Hs.72222 

Hs.89404 

Hs.166196 



422282 


AF019225 


Hs.1 14309 


425852 


AK001504 


Hs.159651 


400844 






404875 






422809 


AK001379 


Hs.121028 


431347 


AI133461 


Hs.251664 


413804 


T64682 




444163 


AI126098 




444444 


AM9332 


Hs.14855 


427747 


AW411425 


Hs.1 80655 


419741 


NM_007019 


Hs.93002 


420281 


A1623693 


Hs.323494 


402305 
436608 






427239 


AA628980 
BE270447 




413278 


8E563085 


Hs.833 


424008 


R02740 


Hs.1 37555 


414907 


X90725 


Hs.77597 


421379 


Y15221 


Hs.103982 


426028 


mjoomo 


Ms. 172028 


417079 


U65590 


Hs.81134 


401093 






422247 


U18244 


Hs.1 13602 


449722 


BE280074 


Hs.23960 


423673 


BE0Q3054 


Hs.1695 


418478 


U38945 


Hs.1 174 


439606 


W79123 


Hs.58561 


418543 


NM.005329 


Hs.85982 


418678 


NMJW1327 


Hs.167379 


428182 


BE386042 


Hs.293317 


427335 


AA448542 


Hs.251677 


409420 


Z15008 


Hs.54451 



Unigene Title 

integrin, alpha 5 (fibronecfin receptor, 
matrix metalloproteinase 10 (stromerysin 
matrix metaBoprotelnase 1 (interstitial 
solute earner family 7, (caBonic amino 
FGENES predicted novel secreted protein 
G protein-coupled receptor 51 
small inducible cytokine subfamily B (Cy 
ESTs; homologue of PEM-3 [dona savignyl 
matrix metalloproteinase 11 (stromelysin 
short-chain alcohol dehydrogenase family 
Homo sapiens cDNA: RJ21245 fis, clone C 
ESTs 

transcription factor 
corticotropin releasing hormone 
ESTs 
ESTs 

gb:an03c03.x1 Straiagene schizo brain SI 

serine/threonine kinase 15 

TONDU 

ESTs 

hypothetical protein MGC14128 
cytochrome P45C, subfamily XXJV (vitamin 
uroplakin 1A 
uroplakin 2 

ESTs, Weakly similar to dJ365012.1 [H.sa 
similar to SALL1 (sal (Drosophila}-like 
gb:Ul-H-BI3-ala-a-12-0-UU1 NCI_CGAP_Su 
cathepsin E 

cartilage acidic protein 1 

interleukin 8 

ESTs 

hypothetical protein FU 13459 

msh [DrosopMa] homeo box homolog 2 

ATPase, Class I, type 8B, member 1 

C1002652*:gi|544327|sp!Q04799{FMO5_RABiT 
apolipoprotein L 

death receptor 6, TNFsuperfam3y member 
N!^_003105*:Homo sapiens sorfflirwelated 
NM_022819*:Homo sapiens phosphoiipase A2 
hypothetical protein FU10549 
insulin-tike growth factor 2 (somatomedi 
gb:yc48b02r1 Stratagene fiver (937224) 
FGENESH predicted RNaseH domairhcontaini 
ESTs 

serine/threonine kinase 12 
ubiqurfin earner protein E2-C 
Predicted cation efflux pump 
C19000735*:gq4508027|refJNP J03414.1| z 
down syndrome critical region protein DS 
ubiquitin carrier protein 
interferon-stimulated protein, 15 kDa 
putative chemokine receptor; GTP-binding 
polo (DrosophiaHike kinase 
small Inducible cytokine subfamily B (Cy 
a dismlegrin and metaiioproteinase doma 
interleukin 1 receptor antagonist 
C1^00586*:gi(6330167|dbpAA86477.1| (A 
solute carrier family 1 (high affinity a 
cycQnBI 

matrix metalloproteinase 12 (macrophage 

cydin-dependent Wnase inhibitor 2A (me 

G protein-coupled receptor 87 

hyaiuronan synthase 3 

cancer/testis antigen (NY-ES0.1) 

ESTs, Weakfy simflar to GGC INHUMAN G ANT 

G antigen 7B 

laminin, gamma 2 (nfcein (tOOkD), kaSni 
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Seq ID No. 
SeqlDNo.142 
Seq ID No. 344 
Seq ID No.5&6 
Seq ID No. 748 
Seq ID No. 9 & 10 
SeqlDNo.11&12 
Seq ID No. 13 & 14 
Seq ID No. 15 & 16 
Seq ID No. 17&18 
Seq ID No. 19 & 20 
Seq ID No. 21 & 22 
Seq ID No. 23-25 
Seq ID No. 26 & 27 
Seq ID No. 284 29 
Seq ID No. 30 
Seq ID No. 31 4 32 
Seq ID No. 33 
Seq (D No. 344 35 
Seq ID No. 36 & 37 
Seq ID No. 38 & 39 
Seq ID No. 40&41 
Seq ID No. 42443 
Seq ID No. 44 & 45 
Seq ID No. 46 & 47 
Seq ID No. 48 & 49 
Seq ID No. 5*0 & 51 
Seq ID No. 52 
Seq ID No.53&54 
Seq ID No. 55-60 
Seq ID No. 61&62 
Seq ID No. 63 & 64 
Seq ID No. 65 & 66 
Seq ID No. 67468 
Seq ID No. 69 & 70 
Seq ID No. 71 4 72 
Seq ID No. 73 & 74 
Seq ID No. 754 76 
Seq ID No. 77 4 78 
Seq ID No. 79 4 80 
Seq ID No. 81 4 82 
Seq ID No. 83 4 84 
Seq ID No. 85 4 86 
Seq ID No. 87-89 
Seq ID No. 90 4 91 
Seq ID No. 92 4 93 
Seq ID No. 94495 
Seq ID No. 96 4 97 
Seq ID No. 98-100 
Seq ID No. 101 4 102 
Seq ID No. 103 4 104 
Seq ID No. 105 4 106 
Seq ID No. 107 4 108 
Seq ID No. 1094 110 
Seq IDNo.1114112 
Seq ID No. 1134 114 
Seq ID No. 1154 116 
SeqfDNo.117-119 
Seq ID No. 1204121 
Seq ID No. 1224 123 
Seq ID No. 1244 125 
Seq ID No. 126-133 
Seq ID No. 134 4 135 
Seq ID No. 1364 137 
SeqlDNo.138-141 
Seq ID No, 1424 143 
Seq ID No. 144 4 145 
Seq ID No. 146 4 147 
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404440 






433091 


Y12642 


Hs.3185 


408380 


AF123050 


Hs.44532 


409893 


AW247090 


Hs.57101 


424905 


NM002497 


Hs. 153704 


438817 


A1023799 




421478 


AI683243 


Hs.97258 


431515 


NMJ)12152 


Hs.258583 


433159 


AB035898 


Hs. 150587 


432441 


AW292425 


Hs. 163464 


425726 


AF085808 


Hs. 159330 


448045 


AJ29743B 


Hs.20166 


431846 


BE019924 


Hs.271580 


437044 


AL035864 


Hs.69517 


444381 


BE387335 


Hs.283713 


400303 


AA24275B 


Hs.79136 


452747 


BE1 53855 


Hs.61460 


400297 


AI127076 


Hs.306201 


428484 


AF104032 


Hs.184601 


429211 


AF052693 


Hs.198249 


417389 


BE260964 


Hs.82045 


445537 


AJ245671 


Hs.12844 


423961 


013666 


Hs.136348 


417433 


BE270266 


Hs.82128 


444781 


NMJJ14400 


Hs.11950 


430486 


BE062109 


Hs.241551 


425650 


NM.0Q1944 


Hs.1925 


409103 


AF251237 


Hs.1 12208 


129404 


A1267700 




403047 






439738 


BE246502 


Hs.9598 



NM J)21048:Homo sapiens melanoma antigen, Seq ID No. 148 & 149 

lymphocyte antigen 6 complex, locus 0 Seq ID No. 1 50 & 151 

diubiquitin Seq ID No. 152& 153 

minichromosome maintenance deficient (S. Seq ID No. 154 & 155 

NtMA (never in mitosis gene abated k Seq ID No. 156 & 157 

ESTs Seq ID No. 158 

ESTs, Moderately sMar to S29539 ribos Seq ID No. 159 & 1 60 

EDG-7 (endothelial differentiation, tys Seq ID No. 161 & 162 

kinesin-like protein 2 Seq ID No. 163 & 1 64 

ESTs Seq ID No. 165 

uroplakin3 Seq ID No. 166 & 167 

prosiale stem cell antigen Seq ID No. 168 & 169 

uroplakinIB Seq ID No. 170 & 171 

differentially expressed in Fanconfa an Seq ID No. 1 72 & 1 73 

ESTs. Weakly similar to S64054 hypotheli Seq ID No. 1 74 & 175 

LIV-1 protein, estrogen regulated Seq ID No. 176 & 177 

!g superfamiry receptor LN1R Seq ID No. 1 78 & 1 79 

hypothetical protein DKFZp56401 278 Seq ID No. 1 80 & 1 81 

solute carrier family 7 (calionic amino Seq ID No. 1 82 & 1 83 

gap function protein, beta 5 (connexin 3 Seq ID No. 1 84 & 1 85 

mioto'ne (neunie growth-promotrng factor Seq JD No. 186 & 187 

EGF-lika-dcmain, multiple 6 Seq ID No. 188 & 189 

periostin (OSF-2os) Seq ID No. 190 & 191 

5T4 oncofetal trophoblast glycoprotein Seq ID No. 192 & 193 

GPi-anchored metastasis-associated protein homolog Seq ID No. 1 94 & 1 95 

chloride channel, calcium activated, family member 2 Seq ID No. 1 96 & 1 97 

desmoglein 3 (pemphigus vulgaris antigen) Seq ID No. 198 & 1 99 

XAGE-1 protein Seq ID No. 200 & 201 

ESTs Seq ID No. 202 

NWL005656*:Homo sapiens transmembrane protease Seq ID No. 203 & 204 

sema domain, immunoglobulin domain (Ig), transmem Seq ID No. 205 & 206 



TABLE 12B 



Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

413804 1390710J T64682BE1 68190 BE168256 

427239 27647.1 BE270447 AW409921 BE207288 BE207170 D56355 BE263223 BE408171 BE262243 BE392439 BE292738 BE261776 BE314300 BE267719 

BE268715BE513876BE295291 BE297066M210923BE407519H51344BE622905AW248281 AW250313T19021 AA3551 15 AA3 16879 
BE269633 BE621936 AA290724 AA380031 AA993616 BE301331 W79220 H73020 AW797850 BE27461 1 BE314315 BE542407 BE562063 
BE312868 BE278817 BE252503 AA426470 M91670 NM_014501 BE264533 AI884863 AI884942 AI678077 AA829937 AI869333 AW273060 
AA906924 AI571293 AW172642 AA5B2906 AW674596 AW300936 AW615753 AW615587 AW565495 A1859296 AA594105 AA9281 10 
A1038859 AI589898 AA828316 AA938955 AA28371 1 AW170665 BE047759 AW32128 A10t5067 AW070748 AW248955AA975490 
AA21 1097 AI826097 AA826730 AA994072 A1367867 W74508 AI086474 AA827720 AI922589 AW000868 W58033 AA991 461 BE0481 31 
M91 21 8 AA8761 86 AA661759 All 23879 AI089508 AW27291 5 A1566244 AW245061 AA688299 AW250988 A1953468 AJ891 144 AW089131 
AI471577AA293354AA464019BE044549T29587AW956171 BE537716 BE269400 AA91B328 A1538087 AA969243 AW075033 BE139361 
A1340589 AI250128 AI247038 BE138953 AW075177 AI307208 BE049086 AW302327 AI054335 AI345565 AI334881 AI252075 AI254494 
AW075006 AW302733 AW302738 A/054057 A1054217 AI054172 AI054302 AJ053722 AI054060 AI054079 AW075181 A1307473 AI312145 
AI340734 AI334909 AW071 374 BE138502 AW074809 AW301901 AI251662 A1307559 BE139228 AI254764 AW073049 AI251264 AI802837 
AW271867 A1307442 AW075100 AW073456 AW072496 AI270787 AW271039 AW071307 AI610913 AW071289 AI251232 AI251262 
AW073656 AW072901 AI307493 A1255068 AI251289 AI252160 A1271496 A1252427 AW073469 AW071420 A1270156 A1252925 AI252839 
A1252868 AW072520 AW073433 A1340643 AI802854 AI334733 A15B3896 AW07131 1 AI802853 AI28971 1 AI345036 AW072513 AI348921 
AI307478AI305762BE139315AW271034AJ334886AJ340619AW470478BE139260BE138965AW302085BE 

432222 343347 J A1204995 AW827539 AW969908 AW440776 AA528756 

436608 42361 J AA628980 AM 26603 BE504035 

438817 465592J AI023799 AA826307 AI683094 AI307373 AI870547 AW979007 

444163 593658 1 AI126098AI1 84746 Al 148521 

456034 142696.1 AW450979 AA136653 AA136656 AW419381 AA984358 AA492073 BE168945 AA809054 AW238038 BE011212 BE01 1359 BE011367 
BE011368 BE011362BE011215 BE011365 BE011363 



TABLE 12C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et el." refers to the publication entitled "The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLpositton: Indicates nucleotide positions of predicted exons. 



Pkey Ref Strand NLposition 

400844 9188605 Plus 24746-24872,25035-25204 

401093 8516137 Minus 22335-23166 



224 



WO 03/003906 



PCT/US02/21338 



402305 7328724 Plus 

404440 7528051 Plus 

404875 9801324 Plus 

405033 7107731 Minus 



4083241362 
80430-81581 

9658B-96732.97722-97831 
142358-142546 



225 



WO 03/(1(13906 



PCT/US02/21338 
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Table 13 

Seq ID NO: 1 PNA sequence: 

Nucleic Acid Accession g: NM_002205 

Coding sequence: 24.. 3173 



11 



I 

CAGGACAGGG 
TGCAGCTGCG 
TGCCGCCGCC 
CGGGGCCCCC 
GGGTCAGTGT 
GTGGTGCTGT 
TTGACAGCAA 
CTGTGGAGTA 
TCTTGGCATG 
TGGGCACCTG 
GCTCAGATTT 
TCACCAAGAC 
TCCTGTCTQC 
TGGTTCAGGG 
GATACTCTGT 
TGCCCAAAGG 
CCCTCTACAA 
CAGACGTCAA 
GGACCCCTGA 
CCGGCATAGA 
GCAGCTCCTT 
GGGCTCCCTT 
GAGGGCTGGG 
CAGACTTCTT 
ATCTGATTGT 
TGTCCGCTAG 
GCAGCTTAGA 
GAAAACACGT 
AGAAGGGAGG 
CCCTGCTCAT 
ACGAGTCAGA 
ACCCCCAAGC 
GCCGGATAGA 
CTGACCTGCA 
CCCTGAACCT 
TTCGGGTCAC 
TCTCCAGCCT 
TGGGCAACCC 
ATCTCCGGGA 
ACAACTCGCA 
CCCTGAACGG 
GAGACCAGCC 
TCAACCAAGG 
TGGAAGGTCA 
ACCCCATTAA 
AACGGGAAGC 
AGGCTGAGTG 
GTCTGCAGTT 
TTAGCCTGCA 
GGCAGCTGCC 
GCAGCTATGG 
TAGGTCTACT 
GCACCGCCAT 
CAATTTCAGA 
GGGGTCTGGG 
GAGACATACT 
GCACTGTGAA 
GGGACAGAGG 
GAAAGATAAC 
TGTCCAGCTC 
CCAGTAAGAA 
GTGGGCCAAC 
CACAAGGATA 
TGGGGCCAGA 
CCAATCCAGG 
AGTTCACCCT 
AACAGATCTG 
GAGCCCACTG 
CCCTCATCTC 
AACTCTGTTG 



21 
I 



31 



AAGAGCGGGC 
CTGGGGCCCC 
ACCCAGGGTC 
GGGCTCCTTC 
GCTGGTGGGA 
CTACCTCTGT 
AGGCTCTCGG 
CAAGTCCTTG 
CGCTCCACTG 
CTACCTCTCC 
CAGCTGGGCA 
TGGCCGTGTG 
CACTCAGGAG 
GCAGCTGCAG 
GGCTGTTGGT 
GAACCTCACT 
CTTCTCAGGG 
TGGGGACGGG 
CGGGCGGCCT 
GCCCACGCCC 
GACCCCCCTG 
TGGTGGGGAG 
CTCTAAGCCT 
TGGCTCTGCC 
GGGGTCCTTT 
TGCCTCCCTC 
GGGGAACCCT 
TGCTGACTCC 
GGTACGGCGG 
CCAGAATGGG 
ATTTCGAGAC 
CCCAGTGGAC 
GGACAAGGCT 
GCTGGAAGTG 
CACTTTCCAT 
CGCCCCTCCA 
GAGCTGTGAC 
CATGAAGGCA 
CACTAAGAAA 
AAGCGACGTG 
TGTCTCCAAG 
TCAGAAGGAG 
CCCCAGCTCC 
GCAGCTCCTA 
CCCAAAGGGC 
TCCAAGCCGC 
TTTCAGGCTG 
GCATTTCCGA 
GTGTGAGGCT 
CCAAAAAGAG 
CGTCCCACTG 
CATCTACATC 
GGAAAAAGCT 
CTCCCATTCC 
TACTTCTTGA 
TGAAGGGCCA 
GGACCCTTGT 
CCCAGCCTCC 
TAGGAAATCC 
CAACCTGCAA 
CCTGGAACTT 
AAAGAACACT 
GATGCTGGCC 
TCCAGCCTGG 
CAGATCCAGG 
GATTTAGGAG 
GAACCTCAGC 
TCCTGGGCCT 
TCTGCCCAAC 
CAAGTGCAAT 



GCTATGGGGA 
CGGCGCCGAC 
GGGGGCTTCA 
TTCGGATTCT 
GCACCCAAGG 
CCTTGGGGTG 
CTCCTGGAGT 
CAGTGGTTCG 
TACAGCTGGC 
ACAGATAACT 
GCAGGACAGG 
GTTTTAGGTG 
CAGATTGCAG 
ACTCGCCAGG 
GAATTCAGTG 
TACGGCTATG 
GAACAGATGG 
CTGGATGACT 
CAGGAGGTGG 
ACCCTTACCC 
GGGGACCTGG 
ACCCAGCAGG 
TCCCAGGTTC 
CTTCGAGGAG 
GGTGTGGACA 
ACCATCTTCC 
GTGGCCTGCA 
ATTGGTTTCA 
GCACTGTTCC 
GCTCGAGAGG 
AAACTCTCGC 
AGCCACGGCC 
CAGATCTTGC 
TTTGGGGAGC 
GCCCAGAATG 
GAGGCTGAGT 
TACTTTGCCG 
GGAGCCAGTC 
ACCATCCAGT 
GTTTCCTTTC 
CCTGAGGCAG 
GAGGACCTGG 
ATTAGCCAGG 
TATGTGACCA 
CTGGAGTTGG 
AGCTCTGCTT 
CGCTGTGAGC 
GTCTGGGCCA 
GTGTACAAAG 
CGTCAGGTGG 
TGGATCATCA 
CTCTACAAGC 
CAGCTCAAGC 
TGAAGAACCA 
AGGTGCTGAC 
GAGCCAGGGG 
TTACACATAC 
CTGAAGCCTT 
ATTCACAGTT 
AGATCTGTCC 
QGGGAGTTAA 
AACTATGCAT 
CAGGGCCAGA 
GGTCTGGAGT 
ACTATATTTG 
AAGCCAGGAA 
CTGGCCAGAC 
GCAGAATTTG 
CCTTCCCTCA 
AAATCTGACC 



GCCGGACGCC 
CCCCGCTCGT 
ACTTAGACGC 
CAGTGGAGTT 
CTAATACCAG 
CCAGCCCCAC 
CCTCACTGTC 
GGGCAACAGT 
GCACAGAGAA 
TCACCCGAAT 
GTTACTGCCA 
GACCAGGAAG 
AATCTTATTA 
CCAGTTCCAT 
GTGATGACAC 
TCACCATCCT 
CCTCCTACTT 
TGCTGGTGGG 
GCAGGGTCTA 
TCACTGGCCA 
ACCAGGATGG 
GAGTAGTGTT 
TGCAGCCCCT 
GCCGAGACCT 
AGGCTGTGGT 
CCGCCATGTT 
TCAACCTTAG 
CAGTGGAACT 
TGGCCTCCAG 
ATTGCAGAGA 
CGATTCACAT 
TCAGGCCAGC 
TGGACTGTGG 
AGAACCATGT 
TGGGTGAGGG 
ACTCAGGACT 
TGAACCAGAG 
TGTGGGGTGG 
TTGACTTCCA 
GGCTCTCCGT 
TGCTATTCCC 
GACCTGCTGT 
GTGTGCTGGA 
GAGTTACGGG 
ATCCCGAGGG 
CCTCGGGACC 
TCGGGCCCCT 
AGACTTTCTT 
CCCTGAAGAT 
CCACAGCTGT 
TCCTAGCCAT 
TTGGATTCTT 
CTCCAGCCAC 
GTCCCCCCAC 
GGCCAGGGAG 
GGTGAGGAGC 
CCTCTTCATG 
TGCATTTTGG 
CTTTGGGCCA 
TCAGCCTTGC 
GACCTGGCAG 
GGTGCCCCAG 
GCCCAGCTCC 
TGATCTGGAA 
GGCCTGCTCC 
TTTCCCAGGA 
ACAGGCCCTC 
GGTTCTGCCT 
CCTTGGCACC 
CAGTGCCCCC 



41 
I 

AGAGTCCCCT 
GCCGCTGCTG 
GGAGGCCCCA 
TTACCGGCCG 
CCAGCCAGGA 
ACAGTGCACC 
CAGCTCAGAG 
TCGAGCCCAT 
GGAGCCACTG 
TCTGGAGTAT 
AGGAGGCTTC 
CTATTTCTGG 
CCCCGAGTAC 
CTATGATGAC 
AGAAGACTTT 
TAATGGCTCA 
TGGCTATGCA 
GGCACCCCTG 
CGTCTACCTG 
TGATGAGTTT 
CTACAATGAT 
TGTATTTCCT 
GTGGGCAGCC 
GGATGGCAAT 
ATACAGGGGC 
CAACCCAGAG 
CTTCTGCCTC 
TCAGCTGGAC 
GCAGGCAACC 
GATGAAGATC 
CGCTCTCAAC 
CCTACATTAT 
AGAAGACAAC 
GTACCTGGGT 
TGGCGCCTAT 
CGTCAGACAC 
CCGCCTGCTG 
CCTTCGGTTT 
GATCCTCAGC 
GGAGGCTCAG 
AGTAAGCGAC 
CCACCATGTC 
ACTCAGCTGT 
ACTCAACTGC 
TTCCCTGCAC 
TCAGATCCTG 
GCACCAACAA 
GCAGCGGGAG 
GCCCTACCGA 
GCAATGGACC 
CCTGTTTGGC 
CAAACGCTCC 
CTCTGATGCC 
CCTCATTCTA 
AAGCTCCTCT 
TGGGGATCCC 
GATGGGGGAA 
AGAGTTTCCT 
GACATGCCAC 
CAGAGATCCA 
CTCTGGACAG 
GACCAGCTCA 
AAGGGGAATC 
CCCAGACTCA 
AGACCTGATC 
CCTGAAGGGG 
CCTGTTCCCC 
GCCAGCTGCA 
AGACACCCAG 
ACTGACCAGA 



51 



1 

CTCCACGCCG 
TTGCTGCTCG 
GCAGTACTCT 
GGAACAGACG 
GTGCTGCAGG 
CCCATTGAAT 
GGAGAGGAGC 
GGCTCCTCCA 
AGCGACCCCG 
GCACCCTGCC 
AGTGCCGAGT 
CAAGGCCAGA 
CTGATCAACC 
AGCTACCTAG 
GTTGCTGGTG 
GACATTCGAT 
GTGGCCGCCA 
CTCATGGATC 
CAGCACCCAG 
GGCCGATTTG 
GTGGCCATCG 
GGGGGCCCAG 
AGCCACACCC 
GGATATCCTG 
CGCCCCATCG 
GAGCGGAGCT 
AATGCTTCTG 
TGG CAGAAGC 
CTGACCCAGA 
TACCTCAGGA 
TTCTCCTTGG 
CAGAGCAAGA 
ATCTGTGTGC 
GACAAGAATG 
GAGGCTGAGC 
CCAGGGAACT 
GTGTGTGACC 
ACAGTCCCTC 
AAGAATCTCA 
GCCCAGGTCA 
TGGCATCCCC 
TATGAGCTCA 
CCCCAGGCTC 
ACCACCAATC 
CACCAGCAAA 
AAATGCCXIGG 
GAGAGCCAAA 
CACCAGCCAT 
ATCCTGCCTC 
AAGGCAGAAG 
CTCCTGCTCC 
CTCCCATATG 
TGAGTCCTCC 
CTGAAAAGGA 
CCCCAGCCCA 
TCCCCCCCAT 
CTCAGATCCA 
GAAACAACTG 
AAGGACTTCC 
AAAGAAGCCC 
CCCCACCCTG 
GGACAGATGC 
AGAACTCAAA 
GACATTGGCA 
CTGGAGGCCC 
CCATGATGGC 
AGAGAAAGGG 
CTGATGCTGC 
GACTTATTTA 
ACTAGAAAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
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WO 03/003906 



PCT/US02/21338 



AAAA 



Seq ID NO; 2 Protein sequence: 
J Protein Accession fl: NP_002196.1 

1 H 21 31 41 51 

I I I I | I 

MGSRTPESPL HAVQLRWGPR RRPPLVPLLL LLVPPPPRVG GFNLDAEAPA VLSGPPGSFF 60 

IU GFSVEFYRPG TDGVSVLVGA PKANTSQPGV LOGGAVYLCP WGASPTQCTP I EFDSKGSRL 120 

LESSLSSSEG EEPVEYKSLQ WFGATVRAHG SSILACAPLY SWRTEKEPLS DPVGTCYLST 180 

DNFTRILEYA PCRSDFSWAA GQGYCQGGFS AEFTKTGRW LGGPGSYFWQ GQILSATQEQ 240 

IAESYYPEYL INLVQGQLQT RQASSIYDDS YLGYSVAVGE FSGDDTEDFV AGVPKGNLTY 300 

GYVTILNGSD IRSLYNFSGE QMASYFGYAV AATDVNGDGIi DDLLVGAPL.L MDRTPDGRPQ 360 

iJ EVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DLDQDGYNDV AIGAPFGGET 420 

QQGWFVFPG GPGGLGSKPS QVLQPLWAAS HTPDFFGSAL RGGRDLDGNG YPDLIVGSFG 480 

VDKAWYRGR PIVSASASLT IFPAMFNPEE RSCSLEGNPV ACINLSFCLN ASGKHVADSI 540 

GFTVELQLDW QKQKGGVRRA LFLASRQATL TQTLLIQNGA REDCREMKIY LRNESEFRDK 600 

LSPIHIALNF SLDPQAPVDS HGLRPALHYQ SKSRIEDKAQ ILLDCGEDNI CVPDLQLEVF 660 

Z\) GEQNHVYLGD KNALNLTFHA QNVGEGGAYE AELRVTAPPE AEYSGLVRHP GNFSSLSCDY 720 

FAVNQSRLLV CDLGNPMKAG ASLWGGLRFT VPHLRDTKKT IQFDFQILSK NLNNSQSDW 780 

SFRLSVEAQA QVTLNGVSKP EAVLFPVSDW HPRDQPQKEE DLGPAVHHVY EUNQGPSSI 840 

SQGVLELSCP QALEGQQLLY VTRVTGLNCT TNHPINPKGL ELDPEGSLHH QQKREAP5RS 900 

SASSGPQILK CPEAECFRLR CELGPLHQQE SQSLQLHFRV WAKTFLQREH QPFSLQCEAV 960 

LJ YKALKMPYRI LPRQLPQKER QVATAVQWTK AEGSYGVPLW IIILAILFGL LLLGLLIYIL 1020 
YKLGFFKRSL PYGTAMEKAQ LKPPATSDA 

Seq ID NO: 3 DNA sequence; 
Nucleic Acid Accession NM_002425 
JU Coding sequence; 23.. 1453 

1 11 21 31 41 51 

« I I I I I I • 

OJ AAAGAAGGTA AGGGCAGTGA G AATGA TGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 

AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 120 

TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGAAG 180 

AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGGA 240 

GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 

TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACCCACCT 360 

TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 

TGAGAAAGCT CTGAAAGTCT GGGAAGAGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 480 

AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 

TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGA 600 

TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 

CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 

TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 

TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 

GGTGCCCACA AAATCTGTTC CTTCGGGATC TGAGATGCCA GCCAAGTGTG ATCCTGCTTT 900 

GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAGATATTT 960 

TTGGCGAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CATTTTGGCC 1020 

CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 

TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1140 

AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 

CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 1260 

TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 1320 

GCCTAAGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 

ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAGCTG 1440 

GTTACATTGC TAGGCGAGAT AGGGGGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA 1500 

ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT 1560 

GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1620 

ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTGCAAT AGGTGAGAGA 1680 

ATGTATTTTC ATAGATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1740 
CTT 



40 
45 
50 
55 
60 



65 



70 



Seq ID NO: 4 Protein sequence: 
Protein Accession #: NP 002416 



1 11 21 31 41 51 

I I I I I I 

MMHLAFLVLL CLPVCSAYPL SGAAKEEDSN KDLAQQYLEK YYNLEKDVKQ FRRKDSNLIV 60 

KKIQGMQKFL GLEVTGKLDT DTLEVMRKPR CGVPDVGHFS SFPGMPKHRK THLTYRIVNY 120 

TPDJLiPRDAVD SAIEKALKVW EEVTPLTFSR LYEGEADIMI SFAVKEHGDF YSFDGPGHSL 180 

75 AHAYPPGPGL YGDIHFDDDE KWTEDASGTN LFLVAAHELG HSLGLFHSAN TEALMYPLYN 240 

SFTELAQFRL SQDDVNGIQS LYGPP PASTE EPLVPTKSVP SGSEMPAKCD PALSFDAIST 300 

LRGEYLFFKD RYFWRRSHWN PEPEFHliISA FWPSLPSYLD AAYEVNSRDT VFIFKGNEFW 360 

AIRGNEVQAG YPRGIHTLGF PPTIRKIDAA VSDKEKKKTY FFAADKYWRF DENSQSMEQG 420 

227 



WO 03/003906 



PCT/US02/21338 



FPRLIADDFP GVEPKVDAVL QAFGFFYFFS GSSQFEFDPN ARMVTHILKS NSWLHC 

Seg ID NO: 5 DNA sequence t 

Nucleic Acid Accession tt: NM_002421 

Coding sequence: 72.. 1481 



10 

15 

20 

25 

30 

35 

40 

45 



GGGATATTGG 
CAAAGGCCAG 
CTCACAGCTT 
ACCTGGAAAA 
GTGGCCCAGT 
GGAAACCAGA 
TGGCTCAGTT 
GGATTGAAAA 
CCTTCCAACT 
CAGACATCAT 
CTGGAGGAAA 
TTGATGAAGA 
CTCATGAACT 
ACCCTAGCTA 
AAGCCATATA 
CGTGTGACAG 
TTAAAGACAG 
TTTCTGTTTT 
GAGATGAAGT 
TACACGGATA 
TCGATGCTGC 
ACTGGAGGTA 
ATGACTTTCC 
ATTTCTTTCA 
TCCAGAAAGC 
AAAACACATG 
CATTTTTAAC 
CATATTTTTT 
ATGGTGGGTA 
GATCTTTTCC 
ATATCCTTTC 
CAAGAACACA 
ACAAAATAAG 



11 
I 

AGTAGCAAGA 
TATGCACAGC 
CCCAGCGACT 
ATACTACAAC 
GGTTGAAAAA 
TGCTGAAACC 
TGTCCTCACT 
TTACACGCCA 
CTGGAGTAAT 
GATATCTTTT 
TCTTGCTCAT 
TGAAAGGTGG 
CGGCCATTCT 
CACCTTCAGT 
TGGACGTTCC 
TAAGCTAACC 
ATTCTACATG 
CTGGCCACAA 
CCGGTTTTTC 
CCCCAAGGAC 
TCTTTCTGAG 
TGATGAATAT 
TGGAATTGGC 
TGGAACAAGA 
TAATAGCTGG 
GTGTGAGTCC 
CTCTAGAGTC 
TACTATTTAG 
CAAAAAGTCA 
AGAGTATGCA 
AAGACAGAAA 
TGTGCAGTCA 
TGTTTTATGT 



21 
I 

GGCTGGGAAG 
TTTCCTCCAC 
CTAGAAACAC 
CTGAAGAATG 
TTGAAGCAAA 
CTGAAGGTGA 
GAGGGGAACC 
GATTTGCCAA 
GTCACACCTC 
GTCAGGGGAG 
GCTTTTCAAC 
ACCAACAATT 
CTTGGACTCT 
GGTGATGTTC 
CAAAATCCTG 
TTTGATGCTA 
CGCACAAATC 
CTGCCAAATG 
AAAGGGAATA 
ATCTACAGCT 
GAAAACACTG 
AAACGATCTA 
CACAAAGTTG 
CAATACAAAT 
TTCAACTGCA 
AAAGAAGGTG 
ACTGATACAC 
AATGTAGCCC 
AGTTTGTGGC 
ACTCTGACGT 
GAGACAGGAG 
CTGGTGTCAC 
TTGGAATAAA 



31 
I 

CCATCACTTA 
TGCTGCTGCT 
AAGAGCAAGA 
ATGGGAGGCA 
TGCAGGAATT 
TGAAGCAGCC 
CTCGCTGGGA 
GAGCAGATGT 
TGACATTCAC 
ATCATCGGGA 
CAGGCCCAGG 
TCAGAGAGTA 
CCCATTCTAC 
AGCTAGCTCA 
TCCAGCCCAT 
TAACTACGAT 
CCTTCTACCC 
GGCTTGAAGC 
AGTACTGGGC 
CCTTTGGCTT 
GAAAAACCTA 
TGGATCCAGG 
ATGCAGTTTT 
TTGATCCTAA 
GGAAAAATTG 
TTTTCCTGAA 
AGAATATAAT 
TTTTTGTACT 
TTATGGATTC 
TGATCCCAGA 
ACATGAGTCT 
CCTGGATAGG 
GTCAACCTTG 



41 

I 

CCTTGCACTG 
GCTGTTCTGG 
TGTGGACTTA 
AGTTGAAAAG 
CTTTGGGCTG 
CAGATGTGGA 
GCAAACACAT 
GGACCATGCC 
CAAGGTCTCT 
CAACTCTCCT 
TATTGGAGGG 
CAACTTACAT 
TGATATCGGG 
GGATGACATT 
CGGCCCACAA 
TCGGGGAGAA 
GGAAGTTGAG 
TGCTTACGAA 
TGTTCAGGGA 
CCCTAGAACT 
CTTCTTTGTT 
TTATCCCAAA 
CATGAAAGAT 
AACGAAGAGA 
AACATTACTA 
GAACTGTCTA 
CTTATTTATA 
GATATAATTT 
ATATAGGCCA 
GAGCAGCTTC 
TTGCCGGAGG 
CAAGGGATAA 
TTTCTACTGT 



51 
I 

AGAAAGAAGA 
GGTGTGGTGT 
GTCCAGAAAT 
CGGAGAAATA 
AAAGTGACTG 
GTGCCTGATG 
CTGACCTACA 
ATTGAGAAAG 
GAGGGTCAAG 
TTTGATGGAC 
GATGCTCATT 
CGTGTTGCGG 
GCTTTGATGT 
GATGGCATCC 
ACCCCAAAAG 
GTGATGTTCT 
CTCAATTTCA 
TTTGCCGACA 
CAGAATGTGC 
GTGAAGCATA 
GCTAACAAAT 
ATGATAGCAC 
GGATTTTTCT 
ATTTTGACTC 
ATTTGAATGG 
TTTTCTCAGT 
CCTCAGTTTG 
AGTTCCACAA 
GAGTTGCAAA 
AGTGACAAAC 
AAAAGCAGCT 
CTCTTCTAAC 
TTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



Seq ID NO: 6 Protein sequence: 
Protein Accession #: NP_002412 



50 
55 
60 
65 
70 
75 



MHSFPPLLLL 
VBKLKQMQEF 
YTPDLPRADV 
LAHAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 
I 

LFWGWSHSF 
FGLKVTGKPD 
DHAIEKAFQL 
IGGDAHFDED 
DDIDGIQAIY 
EVELNFISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 
I 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQLPNGLEA 
LSEENTGKTY 
GTRQYKFDPK 



31 
I 

VDLVQKYLEK 
RCGVPDVAQF 
KVSEGQADIM 
NLHRVAAHEL 
GPQTPKACDS 
AYEFADRDEV 
FFVANKYWRY 
TKRILTLQKA 



41 

1 

YYNLKNDGRQ 
VLTEGNPRWE 
ISFVRGDHRD 
GHSLGLSHST 
KLTFDAITTI 
RFFKGNKYWA 
DEYKRSMDPG 
NSWFNCRKN 



51 
I 

VEKRRNSGPV 
QTHLTYRIEN 
NSPFDGPGGN 
DIGALMYPSY 
RGEVMFFKDR 
VQGQNVLHGY 
YPKMIAHDFP 



Seq ID NO: 7 DNA sequence •■ 

Nucleic Acid Accession #: NM_014331 

Coding sequence: 1..1S06 



1 

I 

ATGGTCAGAA 
AACGGGAGGC 
AAGAGGAAAG 
GGAATCTTCA 
ACCATCTGGA 
GGAACAACTA 
TTACCAGCTT 
GTGATATCCC 
CCTGAACTTG 
AGCATGAGTG 
GCAATTCTGA 
TTTAAAGACG 



11 
I 

AGCCTGTTGT 
TGCCTTCCCT 
TCACTTTACT 
TCTCTCCTAA 
CGGTGTGTGG 
TAAAGAAATC 
TTGTACGAGT 
TGGCATTTGG 
CGATCAAGCT 
TCAGCTGGAG 
TAATTATAGT 
CGTTTTCAGG 



21 
I 

GTCCACCATC 
GGGCAACAAG 
GAGGGGAGTC 
GGGCGTGCTC 
GGTCCTGTCA 
TGGAGGTCAT 
CTGGGTGGAA 
ACGCTACATT 
CATTACAGCT 
CGCCCGGATC 
CCCTGGAGTT 
AAGAGATTCA 



31 
1 

TCCAAAGGAG 
GAGCCACCTG 
TCCATTATCA 
CAGAACACGG 
CTATTTGGAG 
TACACATATA 
CTCCTCATAA 
CTGGAACCAT 
GTGGGCATAA 
CAGATTTTCT 
ATGCAGCTAA 
AGTATTACGC 



41 
I 

GTTACCTGCA 
GGCAGGAGAA 
TTGGCACCAT 
GCAGCGTGGG 
CTTTGTCTTA 
TTTTGGAAGT 
TACGCCCTGC 
TTTTTATTCA 
CTGTAGTGAT 
TAACCTTTTG 
TTAAAGGTCA 
GGTTGCCACT 



51 
I 

GGGAAATGTT 
AGTGCAGCTG 
CATTGGAGCA 
CATGTCTCTG 
TGCTGAATTG 
CTTTGGTCCA 
AGCTACTGCT 
ATGTGAAATC 
GGTCCTAAAT 
CAAGCTCACA 
AACGCAGAAC 
GGCTTTTTAT 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



228 



WO 03/003906 



TATGGAATGT ATGCATATGC TGGCTGGTTT TACCTCAACT TTGTTACTGA AGAAGTAGAA 780 

AACCCTGAAA AAACCATTCC CCTTGCAATA TGTATATCCA TGGCCATTGT CACCATTGGC 840 

TATGTQCTGA CAAATGTGGC CTACTTTACG ACCATTAATG CTGAGGAGCT GCTGCTTTCA 900 

AA 7 ^ GTGG CAGTGACCTT TTCTGAGCGG CTACTGGGAA ATTTCTCATT AGCAGTTCCG 960 

ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC ATCAACGGTG GTGTGTTTGC TGTCTCCAGG 1020 

TTATTCTATG TTGCGTCTCG AGAGGGTCAC CTTCCAGAAA TCCTCTCCAT GATTCATGTC 1080 

CGCAAGCACA CTCCTCTACC AGCTGTTATT GrTTTGCACC CTTTGACAAT GATAATGCTC 1140 

TTCTCTGGAG ACCTCGACAG TCTTTTGAAT TTCCTCAGTT TTGCCAGGTG GCTTTTTATT 1200 

GGGCTGGCAG TTGCTGGGCT GATTTATCTT CGATACAAAT GCCCAGATAT GCATCGTCCT 1260 

TTCAAGGTGC CACTGTTCAT CCCAGCTTTG TTTTCCTTCA CATGCCTCTT CATGGTTGCC 1320 

CTTTCCCTCT ATTCGGACCC ATTTAGTACA GGGATTGGCT TCGTCATCAC TCTGACTGGA 1380 

GTCCCTGCGT ATTATCTCTT TATTATATGG GACAAGAAAC CCAGGTGGTT TAGAATAATG 1440 

TCAGAGAAAA TAACCAGAAC ATTACAAATA ATACTGGAAG TTGTACCAGA AGAAGATAAG 1500 

TTA ~ ACTA ATGGACTTGA GATCTTGGCA ATCTGCCCAA GGGGAGACAC AAAATAGGGA 1560 

TTTTTACTTC ATTTTCTGAA AGTCTAGAGA ATTACAACTT TGGTGATAAA CAAAAGGAGT 1620 

CAGTTATTTT TATTCATATA TTTTAGCATA TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 1680 

AACTCTATGT AGTTATAGAA AGTGAATATG CAGTTATTCT ATGAGTCGCA CAATTCTTGA 1740 

GTCTCTGATA CCTACCTATT GGGGTTAGGA GAAAAGACTA GACAATTACT ATGTGGTCAT 1800 

TCTCTACAAC ATATGTTAGC ACGGCAAAGA ACCTTCAAAT TGAAGACTGA GATTTTTCTG 1860 

TATATATGGG TTTTGTAAAG ATGGTTTTAC ACACTACAGA TGTCTATACT GTGAAAAGTG 1920 

TTTTCAATTC TGAAAAAAAG CATACATCAT GATTATGGCA AAGAGGAGAG AAAGAAATTT 1980 

ATTTTACATT GACATTGCAT TGCTTCCCCT TAGATACCAA TTTAGATAAC AAACACTCAT 2040 

GCTTTAATGG ATTATACCCA GAGCACTTTG AACAAAGGTC AGTGGGGATT GTTGAATACA 2100 

TTAAAGAAGA GTTTCTAGGG GCTACTGTTT ATGAGACACA TCCAGGAGTT ATGTTTAAGT 2160 

AAAAATCCTT GAGAATTTAT TATGTCAGAT GTTTTTTCAT TCATTATCAG GAAGTTTTAG 2220 

TTATCTGTCA TTTTTTTTTT TCACATCAGT TTGATCAGGA AAGTGTATAA CACATCTTAG 2280 

AGCAAGAGTT AGTTTGGTAT TAAATCCTCA TTAGAACAAC CACCTGTTTC ACTAATAACT 2340 

TACCCCTGAT GAGTCTATCT AAACATATGC ATTTTAAGCC TTCAAATTAC ATTATCAACA 2400 

TGAGAGAAAT AACCAACAAA GAAGATGTTC AAAATAATAG TCCCATATCT GTAATCATAT 2460 

CTACATGCAA TGTTAGTAAT TCTGAAGTTT TTTAAATTTA TGGCTATTTT TACACGATGA 2520 

TGAATTTTGA CAGTTTGTGC ATTTTCTTTA TACATTTTAT ATTCTTCTGT TAAAATATCT 2580 

CTTCAGATGA AACTGTCCAG ATTAATTAGG AAAAGGCATA TATTAACATA AAAATTGCAA 2640 

AAGAAATGTC GCTGTAAATA AGATTTACAA CTGATGTTTC TAGAAAATTT CCACTTCTAT 2700 

ATCTAGGCTT TGTCAGTAAT TTCCACACCT TAATTATCAT TCAACTTGCA AAAGAGACAA 2760 

CTGATAAGAA GAAAATTGAA ATGAGAATCT GTGGATAAGT GTTTGTGTTC AGAAGATGTT 2820 

GTTTTGCCAG TATTAGAAAA TACTGTGAGC CGGGCATGGT GGCTTACATC TGTAATCCCA 2880 

GCACTTTGGG AGGCTGAGGG GGTGGATCAC CTGAGGTCGG GAGTTCTAGA CCAGCCTGAC 2940 

CAACATGGAG AAACCCCATC TCTACTAAAA ATACAAAATT AGCTGGGCAT GGTGGCACAT 3000 

GCTGGTAATC TCAGCTATTG AGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CCGGGAGGCG 3060 

GAGGTTGCAG TGAGCCAAGA TTGCACCACT GTACTCCAGC CTGGGTGACA AAGTCAGACT 3120 
CCATCTCCAA AAAAAAAAAA AAAA 



Seq ID NO: 8 Protein sequence? 
Protein Accession $: NP_055146 



1 11 21 31 41 51 

I I I I I I 

MVRKPWSTI SKGGYLQGNV NGRLPSLGNK EPPGQEKVQL KRKVTLLRGV SIIIGTIIGA 60 

GIPISPKGVL QNTGSVGMSL TIWTVCGVLS LFGALSYAEL GTTIKKSGGH YTYILEVFGP 120 

LPAFVRVWVE LLIIRPAATA VISLAFGRYI LEPFPIQCEI PBLAIKLITA VGITWMVLN 180 

SMSVSWSARI QIFLTFCKLT AILIIIVPGV MQLIKGQTQN FKDAFSGRDS SITRLPLAFY 240 

YGMYAYAGWF YLMFVTEEVE NPEKTIPLAI CISMAIVTIG YVLTNVAYFT TINAEELLLS 300 

NAVAVTFSER LLGNFSLAVP IFVALSCFGS MNGGVFAVSR LFYVASREGH LPEILSMIHV 360 

RKHTPLPAVI VLHPLTMIML FSGDLDSLLN FLSFARWLFI GLAVAGLIYL RYKCPDMHRP 420 

FKVPLFIPAL FSFTCLFMVA LSLYSDPFST GIGFVITI/PG VPAYYIiFIIW DKKPRWFRIM 480 
SEKITRTLQI ILEWPEEDK L 

Seq ID MO: 9 DNA sequence 
Nucleic Acid Accession #: FGENESH 
Coding sequence : 1 . . 546 

1 11 21 31 41 51 

I I I I I I 

ATGGCCTTGG GCAGCTCCGC CCCTGTGGCT TTGCAGGGTA ATGCCCACTT CCCTGCTGCT 60 

TTCATGGCTG GCATTAAGTG TCTGTGGCTT TTCCAGGTAG TCCCCCTGGG GCTCCCCGAG 120 

TTGGTGCAAA GGCTCCTGGG TGGAGCTCGA ACTGAAACTC GCTTTGTGCC CGCAGCCCTG 180 

CAGCTCGCCG GTGCCCTCGA CCTGCCCGCT GGGTCCTGTG CCTTTGAAGA GAGCACTTGC 240 

GGCTTTGACT CCGTGTTGGC CTCTCTGCCG TGGATTTTAA ATGAGGAAGG CCAGCAACCT 300 

TTCTGGTCCT CAGGAGACAT GTCTGACTGG GACTACTGGG TTGGCTGGCG GAAGTTAATT 360 

CATTCTCCTC TGAGCACTCC AGGGTGGAGC AGGCAGGTTA GGCTCCAGTT GTTCCAGCTT 420 

CAGTTTGTCA AAGGCCAGAA CTTGGACGTA ACAGTGTACT GCAGGCTCCA GGGCAGTGAG 480 

AAACCCTTTG AAACTGGTTC CATGGTTCCA TTCACCTTCA TGTACTGGAT CCACCATGGA 540 
AAGTAG 

Seq ID NO.: 10 Protein Sequence 
Protein Accession 8; FGENESH 



229 



WO 03/003906 



a 11 21 31 41 51 

I I I I I I 

MALGSSAPVA LQGNAHFPAA FMAGIKCLWL FQWPLGLPE LVQRLLGGAR TETRFVPAAL 
QLAGALDLPA GSCAFEESTC GFDSVLASLP WILNEEGQQP FWSSGDMSDW DYWVGWRKLI 
HSPLSTPGWS RQVRLQLFQL QFVKGQNLDV TVYCRLQGSE KPFETGSMVP FTFMYWIHHG 
K 



60 

120 

180 



Seq ID NO: 11 DKA sequence 

Nucleic Acid Accession fh EOS sequence 

Coding sequence: 461-32 86 



11 



! 



GCCGCCCCTG AGCAGCCTCG 
CCGAGCTCCA GGGGCTGCCG 
GGGGGCGGGC GGAGGCGGCC 
ACGCCGGCCC CCCTCTCCGC 
CCCACCCCTT CCCGGCGTGA 
GCGGCCGTTC TGAGCCGAGC 
GCCATTGCGC GGGCGCCGCG 
GCGGGCCGAG TGAGCCGGCG 
CGGGCAGCCC GGGCCGCCGC 
GCTGCTGCCG CTGCTGCTGC 
CCGGCCGCCG CCCAGCAGCC 
GGTGGCCAAG GGCAGCATCG 
GATCCGCAAC GAGTCACTCC 
GTGCGACAAC GCAAAAGGGT 
CTTGATGGTG TTTGGAGGCG 
AGGCTGGAAT CTGGTGCAGC 
AAAATACCCT TATTTCTTTC 
GAAGTTGCTC AAGCACTACC 
GTTCTCTGAG GTGCGGAATG 
AGACACCGAG AGCTTCTCCA 
TGTGCGGATC ATCCTTGGCC 
ATACGAGGAG AACATGTATG 
TTCTTGGTGG GAGCAGGTGC 
GCTTGCTGCC ATGGAGGGCT 
CAAGACCATC TCAGGAAAGA 
AGGCGTGGGG CCCAGCAAGT 
GACACTGCAG AGGGCCATGG 
CTTCAACTAC ACGGACCACA 
CTTCTTCGGG GTCACGGGTC 
ATTTACTCAA TTTCAAGACA 
CACACTGGAG ATCATCAATG 
GACCATCATC CTGGAGCAGC 
CCTCACCATC CTCGGGATGA 
GAATCAGAAG CTCATAAAGA 
GATGCTCTCC TATGCTTCCA 
GACCTTTGAA ACACTTTGCA 
TTTTGGGGCC ATGTTTGCAA 
GAAGAAGAAG ATCATCAAGG 
CGACCTGTGT ATCCTGATCT 
GTACAGCATG GAGCCGGACC 
CTGTGAGAAC ACCCATATGA 
CATGTTGTTC GGTTGTTTCT 
CGACAGCAAG TACATCGGGA 
CGCTGTCTCC TTCCTGACCC 
CATCATCTTC TGCAGCACCA 
GAGAACAAAC CCAGATGCAG 
GAAAGAAGAT TCTAAAACGT 
CCTGGAGGGC CTACAGTCAG 
AGACTTGGAA GAGGTCACCA 
ACAGAACCAC TACCAAGAGC 
AGATGGAGGA AAGGCCATTT 
CACAACAGAG CCCTCTCGAA 
CATCCAGCGT CGGCTGTCCC 
CGGAGGCGTG GACGCCAGCT 
CAGACATGTG CCACCCTCCT 
GGCCCGGGGC CTCCCCCGTG 
TGTCGGCTCT GGCTGCGGAG 
GGCACTCAGG TGGACAGGAC 
TGTGAAGTCC TTATTTCTTC 
CTGCAAACAA GGAGGCGCTG 
CAAAAAAACT AGACAAGGAG 
GAGCAGCCTT GGGAAGAGGC 
GGCTGTGACC CCTCAGGCTA 
AGGCCCCCAG CTGCCAGGCC 



21 
I 

CCTTCGCCTC 
CCTAGCAGCT 
CGGTGCGGGG 
GTTTGCTCGC 
TTGATCCGTC 
CGGAACCCTA 
GGAAGACCTT 
CCCGCAGCCC 
CGCCGCCGCC 
CTCTGGCGCC 
CGCCGCTCTC 
GGCGCGGTGT 
TGCGCCCCTA 
TGAAAGCCTT 
TCTGTCCATC 
TTTCTTTTGC 
GGACCGTCCC 
AGTGGAAGCG 
ACCTGACTGG 
ACGATCCCTG 
AGTTTGACCA 
GTAGTAAATA 
ACACGGAAGC 
ACATTGGCGT 
CTCCACAGCA 
TCCACGGGTA 
AGACACTGCA 
CGCTGGGCAG 
AAGTTGTATT 
GCAGGGAGGT 
ACACCATCAG 
TGCGGAAGAT 
TCATGGCCAG 
TGTCGAGTCC 
TATTTCTCTT 
CCGTCAGGAC 
AGACCTGGAG 
ACCAGAAACT 
GCTGGCAGGC 
CAGCAGGACG 
CCATCTGGCT 
TAGCTTGGGA 
TGAGTGTCTA 
GGGACCAGCC 
TCACCCTCTG 
CAACGCAGAA 
CCACCTCGGT 
AAAACCATCG 
TGCAGCTGCA 
TCAATGACAT 
TAAAAAATCA 
CATGCAAAGA 
TCCAGCTCCC 
GTGTCAGCCC 
TCCGAGTCAT 
ACAGAACCAC 
AAGCTGGGCA 
GGGGCAGGGG 
ACAAAGAAGA 
GGATATCAAA 
AGAGGCACTA 
AAGGAGCTTC 
GCCCTTCTCC 
AGCTGACCCC 



31 



41 
I 



51 

I 



CCGCGTTTCC TGCCGTCCGC 
CCCGGCGGGA GAGCGGTTCA 
CGGCCGCGCT GGAGAGAGGC 
TTGCTCCCCG CCTCCCGCAC 
ACGGGCGCCG CCGCTGCCGC 
GCCCGAGACG GAGCCGGGGC 
GGCGCGGGGC GGCGGGCCGG 
GCGGCGCGGC ATGGCTTCCC 
ACCGCCGCCC GCGCGCCTGC 
CGGGGCCTGG GGCTGGGCGC 
CATCATGGGC CTCATGCCGC 
GCTCCCCGCC GTGGAACTGG 
CTTCCTCGAC CTGCGGCTCT 
CTACGATGCA ATAAAATACG 
CGTCACATCC ATCATTGCAG 
TGCAACCACG CCTGTTCTAG 
ATCAGACAAT GCGGTGAATC 
CGTGGGCACG CTGACGCAAG 
AGTTCTGTAT GGCGAGGACA 
TACCAGTGTC AAAAAGCTGA 
GAATATGGCA GCAAAAGTGT 
TCAGTGGATC ATTCCGGGCT 
CAACTCATCC CGCTGCCTCC 
GGATTTCGAG CCCCTGAGCT 
GTATGAGAGA GAGTACAACA 
CGCCTACGAT GGCATCTGGG 
TGCCAGCAGC CGGCACCAGC 
GATCATCCTC AATGCCATGA 
CCGGAATGGG GAGAGAATGG 
GAAGGTGGGA GAGTACAACG 
GTTCCAAGGA TCCGAACCAC 
CTCCCTACCT CTCTACAGCA 
TGCTTTTCTC TTCTTCAACA 
ATACATGAAC AACCTTATCA 
TGGCCTTGAT GGATCCTTTG 
CTGGATTCTC ACCGTGGGCT 
AGTCCACGCC ATCTTCAAAA 
GCTTGTGATC GTGGGGGGCA 
TGTGGACCCC CTGCGAAGGA 
GGATATCTCC ATCCGCCCTC 
TGGCATCGTC TATGCCTACA 
GACCCGCAAC GTCAGCATCC 
CAACGTGGGG ATCATGTGCA 
CAATGTGCAG TTCTGCATCG 
CCTGGTATTC GTGCCGAAGC 
CAGGCGATTC CAGTTCACTC 
CACCAGTGTG AACCAAGCCA 
CCTGCGAATG AAGATCACAG 
GGACACACCA GAAAAGACCA 
CCTCAACCTG GGAAACTTCA 
CCTCGATCAA AATCCCCAGC 
TCCTATAGAA GATATAAACT 
CATCCTCCAC CACGCCTACC 
CTGCGTCAGC CCCACCGCCA 
GGTCTCGGGC CTGTAAGGGT 
ACTGGGCAGA GGGGTCTGCT 
CCATGGCTGG CCTCTCAGGA 
GAGACTTGGC ACCTGACCTC 
GGAACGGAAA TGGGACGTCT 
CTTGCAAAAA AAAAAAAAAA 
GAACTCCAGC TGGAAGTCAC 
TGAAGAAACT GCCTCTGCAC 
ACTCTGGGGG AGGAGGTGGG 
AGCCTTCCTG GAACAGGGAG 



CCTCCCCCGG 
GAGCGCGCAC 
GCGCGCGGAG 
TCCGCTCGCT 
CGCCGCCGCC 
CCGGGCCGGC 
GCCAGGCCAT 
CGCGGAGCTC 
TACTGCTACT 
GGGGCGCCCC 
TCACCAAGGA 
CCATCGAGCA 
ATGACACGGA 
GGCCTAACCA 
AGTCCCTCCA 
CCGATAAGAA 
CAGCCATTCT 
ACGTTCAGAG 
TTGAGATTTC 
AGGGGAATGA 
TCTGTTGTGC 
GGTACGAGCC 
GGAAGAATCT 
CCAAGCAGAT 
ACAAGCGGTC 
TCATCGCCAA 
GGATCCAGGA 
ACGAGACCAA 
GGACCATTAA 
CTGTGGCCGA 
CAAAAGACAA 
TCCTCTCTGC 
TCAAGAACCG 
TCCTTGGAGG 
TCTCTGAAAA 
ACACGACCGC 
ATGTGAAAAT 
TGCTGCTGAT 
CAGTGGAGAA 
TCCTGGAGCA 
AGGGACTTCT 
CCGCACTCAA 
TCATCGGGGC 
TGGCTCTGGT 
TCATCACCCT 
AGAATCAGAA 
GCACATCCCG 
AGCTGGATAA 
CCTACATTAA 
CTGAGAGCAC 
TACAGTGGAA 
CTCCAGAACA 
TCCCATCCAT 
GCCCCCGCCA 
GGGAGGCCTG 
GCAGAAACAC 
CCACTCGGAT 
GAGCCTTATT 
TCCTTAACAT 
AAAAAAAAAA 
GGAGTGGCTC 
ACACATCACT 
AAGGGCCACC 
TCTGCAGGAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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CGCAGACAGG CACAGCCCTG GAGCAGGCAG GCCGAGGGCT GCGGCACTGG AGCAGGCTGA 3900 

CTTACATGCT CCACATGGGA CCTGTGTCAC CCAATGAGAT GTTTGTTACT CTGGTAAATG 3960 

CCACACGTTA ACACAATAAC ACCCATTCCT GGGACCGTGG GGATTTAGGG CACGTCACTG 4020 

CAGACACGCT CTGCAGCATT CACCGACAGT CTGTCATGCA CCCACCACGT TGGCCATGTC 4080 

CTTGTGTTCC TATCGGATGC TCCCAGTAAC CAGGGGGACC ACCCGAGCTA ATCATGGAAT 4140 

GTCTGTTCCC AGCAAACACG ATAAAGAAAG ATTGTGCACT TTAACCTCTC TCATCAGGGC 4200 

CCAAGGGCTG GCTGGGATTT TTTTTTTTTT TTTCCCACTA ACTTTGTTTC TGACCAAAGT 4260 

GAATTGGAGG CACTCTGCTA AAAGACATCC CCGTAGACAT AGGGGAGAGA GTTGCTGGCT 4320 

GAGGGCTTCC CTTGGCTTCC AGAAGGCAGC CTTCCATCCA GACAAGCCAG TGAGCTCTCC 4380 

CCTTGGGATC ACTGGGGTGA TCAGTCAGCA GATTGATTCT CATTCATAAG ATCATTCCTC 4440 

CCTTTAAATT GAGCCCCTAA GAGCACTGGC CTGGGAGTCA GACAGACCTG GGTTCAAGTC 4500 

CTCAGTCCCC TGCCCACTCC CTACGTGACT TTGATCAGGT CACTAGTGTC TCTCTGAGCC 4560 

TCAGTTTCCC CTCTGTAACT TGGGGTTGAA CTAAAACACC TGTCCTGCCT ACCTCACAAG 4620 

GTCACTCTGA GGATTGAAAC TTGATCTTGT CCAGGAAAGC TTTGTACCAA ACAGTGAAGC 4680 

CGCCCTGATC CGTGAGGTAT GAGTATGACT CTGACCTTCA GCCCTCCCTA CAGCCGGGGG 4740 

TGTGGCCCAG AGAA GCTTCC AGCACAGCCC TCTACCCAGA ACATCCGGGC TGGAGGGAGG 4800 

CTCCCAGTGA CTTTTCTGAC ATTCCTAGAC AGGTTCATTC TTTGCTCAAG AAAGGCCTGA 4B60 

ATGACAATGT CCAGGATGTC TGCACAACTG AGCAGCTCGC TCACTCCCTA AAGAAACCTA 4920 

TTGGCAGCTT CAACAGGCAG GCAATAATCT CTTCCCAGAA CCACTGCAGT CAGGAATAAA 4980 

CTGTTTTCTC CACCAGGCTT TGACAAAAGG GCCCACAGGA ATCTTACCAA TGCCAACATT 5040 

TCAAAGCACC CTATTTCACG TAGCATAGCT TTCTGCTCCC CTTCCCCAAA GAGAGGTTAT 5100 

GGAGGTACTG TAGCTTTTAG GGAAAAAAAA ATGTTAACAC ATCACAGGTC AAGTTGAAGT 5160 

CATTCTCTGT TTAGGCACTA AAAATCGGTG TTGTCACTCA CTGTGTATTA CCAGTATTTA 5220 

CTTGCTTTCT TGATTTCACC AAAACCAAAT TTAATTTAAA GGACCACATT AATTTTTCAA 5280 

AGGGAAAGAG ACAATTAATT GTACATAATG TATACACACA CACAAAAAAA AATACCTGTA 5340 

GAAATATTAT TCCAGCATAG CAGGAAAACA AACAAAAGTA TTGGACTGTC GGAGGTGAGC 5400 

CTGTGCGTCT GTAACCCTTT GTGACTCCTG AGCGTGCGCT GTCTTCTAGG TTAACTCACG 54 60 

AAGTACATTC TCTGTCTTAC TGATACTGTA GGTTCACCCA TTTTTTTTTA ATTTCCTCGC 5520 

AAATAACAAG ACCCACAGAA GTGACTCTAG CTACTTAATG GTTCTGTTCT TTTATATGCA 5580 

GCAAACACAC CGTCCATTTC TGAAGAGGCT TCGGCCTGAA GGCATTTTCC AATGATGTTA 5640 

GTGCACAAAA CGCTTTAAAT TAGACTGGAA CTGCCAGAAT CAAATGTAAA TGAGGAATTT 57 00 

CTCGTACCCC TACTGCATGG TATCGATTTT TAATAAATTG TTGCAAATTT GTTTTTATGA 5760 
ATAAAAGGAA AAAACCTGTC GTCTTT 

Seq ID NO: 12 Protein sequence: 
Protein Accession EOS sequence 

1 11 21 31 41 51 

I I I I I I 

MASPRSSGQP GPPPPPPPPP ARLLLLfiT.T.P LLLPLAPGAW GWARGAPRPP PSSPPLSIMG 60 

LMPLTKEVAK GSIGRGVLPA VELAIEQIRN ESLLRPYFLD LRLYDTECDN AKGLKAFYDA 120 

IKYGPNHLMV PGGVCPSVTS IIAESLOGWN LVQLSFAATT PVLADKKKYP YFFRTVPSDN 180 

AVNPAILKLL KHYQWKRVGT LTQDVQRPSE VRNDLTGVLY GEDIEISDTE SFSNDPCTSV 240 

KKLKGNDVRI ILGQFDQNMA AKVPCCAYEE NMYGSKYQWI IPGHYEPSWW EQVHTEANSS 300 

RCLRKNLLAA MEGYIGVDFE PLSSKQIKTI SGKTPQQYER EYNNKRSGVG PSKFHGYAYD 360 

GIWVIAKTLQ RAMETLHASS RHQRIQDFNY TDHTLGRIIL NAMNETNFFG VTGQWFRNG 420 

ERMGTIKFTQ FQDSREVKVG EYNAVADTLE IINDTIRFQG SEPPKDKTII LEQLRKISLP 480 

LYSILSALTI LGMIMASAFL FFNIKNRNQK LIKMSSPYMN NLIILGGMLS YASIFLFGLD 540 

GSFVSEKTFE TLCTVRTWIL TVGYTTAFGA MFAKTWRVHA IFKNVKMKKK IIKDQKJuLVI 600 

VGGMLLIDLC ILICWQAVDP LRRTVEKYSM EPDPAGRDIS IRPLLEHCEN THMTIWLGIV 660 

YAYKGLLMLF GCFLAWETRN VSIPALNDSK YIGMSVYNVG IMCIIGAAVS FLTRDQPNVQ 720 

FCIVALVIIF CSTITLCLVF VPKLITLRTN PDAATQNRRF QFTQNQKKED SKTSTSVTSV 780 

NQASTSRLEG LQSENHRLRM KITELDKDLE EVTMQLQDTP EKTTYIKQNH YQELNDILNL 840 

GNFTESTDGG KAILKNHLDQ NPQLQWNTTE PSRTCKDPIE DINSPEHIQR RLSLQLPILH 900 
HAYLPSIGGV DAS CVS P CVS PTASPRHRHV PPSFRVMVSG L 

Seq ID NO: 13 DNA sequence 

Nucleic Acid Accession #: NM_0Q1565.1 

Coding sequence: 67-363 

I 11 21 31 41 51 

II I I I I 

GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 60 

AGCACCATGA ATCAAACTGC GATTCTGATT TGCTGCCTTA TCTTTCTGAC TCTAAGTGGC 120 

ATTCAAGGAG TACCTCTCTC TAGAACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCAA 180 

CCTGTTAATC CAAGGTCTTT AGAAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 

CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA 300 

TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AAGATCTCCT 360 

TAAAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 420 

CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 

GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA 540 

GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 

GCTCTACTGA GGTGCTATGT TCTTAGTGGA TGTTCTGACC CTGCTTCAAA TATTTCCCTC 660 

ACCTTTCCCA TCTTCCAAGG GTACTAAGGA ATCTTTCTGC TTTGGGGTTT ATCAGAATTC 720 

TCAGAATCTC AAATAACTAA AAGGTATGCA ATCAAATCTG CTTTTTAAAG AATGCTCTTT 780 

ACTTCATGGA CTTCCACTGC CATCCTCCCA AGGGGCCCAA ATTCTTTCAG TGGCTACCTA 840 

CATACAATTC CAAACACATA CAGGAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT 900 

CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 960 
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TTTCAGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 
TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 
TTTTCAAATA AAAATGAGGT ACTCTCCTGG AAATATTAAG 

Seq ID NO: 14 Protein sequence: 
Protein Accession #: MP 001556.1 



1020 
1080 



51 



1 11 21 31 41 

I I I I I I 

MNQTAILICC LIFLTLSGIQ GVPLSRTVRC TCISISNQPV NPRSLEKLEI IPASQFCPRV 

EIIATMKKKG EKRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence : 241 . . 1272 



11 



21 



31 



41 
I 



51 



GCGCAGACGC CCCAGCCCCC CACCGCCCCC AAAGGGGCGA GCGACGCCAA GCTCTGCGCT 
CTCTACAAAG AGGCCGAGCT GCGCCTGAAG GGCAGCAGCA ACACCACGGA GTGTGTTCCC 
GTGCCCACCT CCGAGCACGT GGCCGAGATC GTGGGCAGGC AAGGCTGCAA GATTAAGGCC 
TTGAGGGCCA AGACCAACAC CTACATCAAG ACACCGGTGA GGGGCGAGGA ACCAGTGTTC 
ATGGTGACAG GGCGACGGGA GGACGTGGCC ACAGCCCGGC GGGAAATCAT CTCAGCAGCG 
GAGCACTTCT CCATGATCCG TGCCTCCCGC AACAAGTCAG GCGCCGCCTT TGGTGTGGCT 
CCTGCTCTGC CCGGCCAGGT GACCATCCGT GTGCGGGTGC CCTACCGCGT GGTGGGGCTG 
GTGGTGGGCC CCAAAGGGGC AACCATCAAG CGCATCCAGC AGCAAACCAA GACATACATT 
ATCACACCAA GCCGTGACCG CGACCCCGTG TTCGAGATCA CGGGTGCCCC AGGCAACGTG 
GAGCGTGCGC GCGAGGAGAT CGAGACGCAC ATCGCGGTGC GCACTGGCAA GATCCTCGAG 
TACAACAATG AAAACGACTT CCTGGCGGGG AGCCCCGACG CAGCAATCGA TAGCCGCTAC 
TCCGACGCCT GGCGGGTGCA CCAGCCCGGC TGCAAGCCCC TCTCCACCTT CCGGCAGAAC 
AGCCTGGGCT GCATCGGCGA GTGCGGAGTG GACTCTGGCT TTGAGGCCCC ACGCCTGGGT 
□AGCAGGGCG GGGACTTTGG CTACGGCGGG TACCTCTTTC CGGGCTATGG CGTGGGCAAG 
CAGGATGTGT ACTACGGCGT GGCCGAGACT AGCCCCCCGC TGTGGGCGGG CCAGGAGAAC 
GCCACGCCCA CCTCCGTGCT CTTCTCCTCT GCCTCCTCCT CCTCCTCCTC TTCCGCCAAG 
GCCCGCGCTG GGCCCCCGGG CGCACACCGC TCCCCTGCCA CTTCCGCGGG ACCCGAGCTG 
GCCGGACTCC CGAGGOGCCC CCCGGGAGAG CCGCTCCAGG GCTTCTCTAA ACTTGGTGGG 
GGCGGCCTGC GGAGCCCCGG CGGCGGGCGG GATTGCATGG TCTGCTTTGA GAGCGAAGTG 
ACTGCCGCCC TTGTGCCCTG CGGACACAAC CTGTTCTGCA TGGAGTGTGC AGTACGCATC 
TGCGAGAGGA CGGACCCAGA GTGTCCCGTC TGCCACATCA CAGCCACGCA AGCCATCCGA 
ATATTCTCCT AAGCCCCGTG CCCCATGCCT CCGGGGCCCA CTCCACTGGG CCCACCCTGG 
ACCTGTTTTC CACTAAAGCC TTTTGGAAAG CGGTGATTTG AGGGGCAAGG TGCTTAGAGA 
TACTCGCTCG CTGGGGAAGG GGGGAGGGAG GCAGTGGTGG CTGGAGGGTG CGCCACTTTC 
AGAGCCTCTG GTCACCCTGT CCTGGAAAGA TTGGGAGGGG GCCAGACTGA AAATTTTACT 
AGAGTTACAA CTCTGATACC TCAACACACC CTTAAATCTG GAAGCAGCTA AGAGAAACTT 
TTGTTTTGCC AGAGGTGGCC ACTAAGGCAT TCTGACGCCC TCTGCCCACC TCCCCCGCTG 
TGTGTCACTC CACCCCTTCT TCCGAGGAGG GGGTGGGTAA AAGGGAGAGG GAGAATTACC 
ACCTGTATCT AGAGGTGCTC TTTGCAATCC CTAAGCCCTC TGGTCCTGAC CTCCGACCTC 
CTAACATGAC CCTTTACCTC CCACCCCACC CCCATATCCT GTTTGGGAAA CTGTCACCAG 
TTTCCAGCAG TGTAAGGGAG TTGGAGTCCT ATCAGAAGTT GCATAGATCT TCTAGGGGTT 
GGGGAGAGAA GCATGTCAAT CGTTTCTGTG GCTGAAAGGC TCAGAAGCCA TCTGTCCCCA 
CAAAGCTGGG CTAGAGGAAT CTGGAGAGGA GTCCTCCTCT CTGCCCCTGT CCCCTGCAGT 
GTTTCCCTTC ACTCTCTCCG CCTATCTTCC CTTCCTTTGG GATCTTCCCT TTCCTCAACT 
CTTTCCTTTC CCTCCAGCTC TTTGCTTTGC TTTCTTTTGG TGGCTGTCAC TCCCAGCTCT 
GTCTTGTTCC TTGTCTTTGT CTTTCTTCCC TTCCCCCTGC CCCTGCCCCT ACCAGCCCAG 
CTTTGGGGAC ACCATCCTTC TGGGGAGAAG TAGGGGGAGG AATATTTGGA TGGTCCCTCC 
ATTCCTCTTC AGGCATCTGG AGGCCCTCTC CCCCACTCCT CCAAAGAAAC ATCTCAAATT 
ATTGATGGAA TGTATCCCCA TTCTCAGTGA AAATGTGAGG AGGGGACTAA TACTGGGGTA 
AAGGGTCAAA CCCCCACCTT CATCACTATG GGCATTATAT TTAGGGAGTA GTTCTTGGGC 
TGGATTTTCT GGTTGTGGAA GTGGGGGCGC CAGAGTAGTG TGTCTGCTAT TTAAAGGAGC 
AGGAAAGGGC GTGAGGCAGG AGGAGAGACT GGTGGAGGGA AGAGCTGCTC CTCCCATGCA 
GTGCCCGACT CCCTGCACCC CTCTCAACCT GACCTGAACC TTTATTGAAT CCTTATTAGC 
TTGAATCCTT ATTAGCTTGA ATCCTCCATG CAAATCATGG AGTCTGTGTC CCACCTGATG 
TGGTTGAGGA GAAGCCAGGT CTTCAAAGAG GGGTCAGCCT GGGGCAAAGC AGGACTGGGG 
GGAGGTGGGC AGCAGGGCCT ATTCTGAGAA TCACATATTG TTACAGGCCT TGCACCCCCT 
TTGCTGCTTC CCTCCTGCTC ATTTGGGGCT GCCACCAGCT CTCCACCCTC CTGGTTCCGC 
TGGCCGGGCC AAGAGAGGAT GGAGGGATGG GAGTCCCAGG AGATCCTTGT AAATAGTGGG 
GTGGGACTGT TCTGAGTGAT CACCCGAGCA CTTAAAGCTC CAGAGTCCCA "CTTCCTGG 
ATGGAGCAGG TGGAGGTGCA GAGGGGATTT CCTCCTCTCC TTCCTCCTGT CGAGAATTAA 
CACCTCTCCA CAGCCTTCCC CTCCAGAACA CCAGCCAGGG AGGGGTGGGG AAGGAGGTCA 
CAGCCAAGAA AACTGCCCTG TGACGACTTC CCTCCTTCCC GCCTATGTGA GCCATCCTGA 
GATGTCTGTA CAATAGAAAC CAAACCAAAT GGGCACCCTC GGTTGCCGGG GGGCAGGTGG 
GGAGGGGGGT GGGAAGAAGG GATGTCTGTC TGTCGTCCCC CTCCCCCTCT CCACTCTTTA 
CCCACAAAGG CAGAAGACTG TTACACTAGG GGGCTCAGCA AATTCAATCC CACCCTTACC 
AATTGAGCCA AACCTAGAAA CAAACACAAA ACACGAATAG TGAGAGACAA AATAGAGGAG 
AGAAAGAGAG CATGAGAGGG AGCGAGACAG GCGACCAACA CAGAGGAGAG AAAACAAAAA 
TAGCAAAAAA AAAAAAAAAA AGCAGTTCTT TATAATTTAA TATTCTATTT TAATAAAGGC 
GTTTATTACC ATATAAATGT AGCAAAGAAC CTGGGCTAAT ATGAAAAAAA AAGACTTTTT 
ATTAGGTAAT TTATTATATG AAAAGGATAT TTTATTTTAT GATAAAGTGA TCCTTAAAAA 
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AATAAAAAAA 
TTGTATTCAG 
TTGCTTTTTT 
ATCCCGGGGG 
CCAGTAGATT 
AGCCGGCAGA 
GACTGAGGAT 
GGAGGGGGAA 
ATTTAAGAGA 
AACCGAGGGG 
AAATTTATTT 
ATATTAAGAT 
AAAGTTGTAA 
AATCTTCAGA 
CTGGGGTTGA 
CATTTCCCAG 
TGGGGGCAGC 
ACTCCAATGT 
CCCCAGGCCT 
TTGATTTAAG 
CCTTCAGAGC 
TCCACTTAGC 
AAAGCTGACT 
CCAAGAAGGA 
GCATCCCCAT 
GACCCTCTGC 
CCTGGCCTGG 
GAGAAATCAA 
CTTCCCCTCC 
CTGCTTCATT 
CTCCCTCTAC 
TGTCCTAGAA 
CCCCACTCGA 
GGATGGTCTC 
CTAGTGAAGA 
GAAATAAAAC 
CCAAGAAGGG 
TACTATTACT 



CTTTAGAAGG 
AGTTAAATCT 
CCCCCACTTT 
TGAATGGTGG 
GGATTTCACT 
GCCAAACAAA 
CCTGGGGTGT 
TCCAGAATTA 
TCTAGATTTT 
TTTAAAAGAA 
CATATAATCC 
AGCTTAAAGT 
GGAAAAGATA 
TACCAGGGTT 
TTCTGAGAAT 
GGTCTTGGAA 
AGAGGAGGCC 
GGTATGTCCC 
AACTGAGGAA 
AGCCTGGGAA 
TTCAAGAGCA 
TTTGACCTCC 
CGTTCTTCAC 
CTTCCCACGG 
GGGCACAGAG 
TCCAGTTCAC 
GAACACATTG 
CATTTTTAAA 
TATCCAAGGG 
ACACCAATGA 
CCAATTTACT 
ACCCCGCTAG 
GGGTGCGGGT 
TCCACTGTAG 
CACGTACAAA 
AGACCCAAGA 
GGAGGGTTAA 
TAAGGCTC 



TTTAGAATAT 
TAAAAAAAAA 
TTTTTTGGGG 
AGAGAGGAGC 
CCATGGACTC 
GAAAGGGATT 
CCCCCACCAC 
AGGCCTAGCA 
TTTTTAACCA 
AAGAATACTA 
TAGAGAGAGA 
TTAGTAGCAT 
AATAGAGGAG 
GGATCAGGTT 
CCTTGGATTT 
AGGATGCACA 
AAGCACATTC 
CTCCTGGGGC 
GTCCTTCTTG 
GTCATTCCAC 
CTGAGGCGAT 
ATGGCAGCAG 
CTACTTGCAA 
AGTGGAGTGG 
GCTGGGGAAA 
CTCCATCTAT 
CCCTCCCCAC 
GCACTTGCCT 
GTTGGAGGGG 
CTCTGCCATC 
CTTCTCGCCC 
TTTGGGATGG 
GGGGAAGGGG 
AAAGTAGAGT 
TCCACCAAGG 
ATAATCAAGT 
TGTATTAAAT 



ATGTAGGGAG 
AAGTGTTTTT 
AGGAATGTCA 
TGGGGGAACC 
CTCCTCCCCT 
AACAAGAAAG 
TTTCCCCTGC 
GGCCTATAGG 
AAAACAAGAG 
CAAAATAATA 
AAGAAACAAT 
TGAGGGCCCC 
GGAAGTGGCT 
GCTAGTTTAA 
TTAAATTGTA 
CTGATCATCT 
ACCTGCACCC 
TCCCAGCTCA 
AAGTGTGACC 
AAACAGACAC 
CAGTCCCCTA 
TAGCAGTAAC 
AGTGCTTTCT 
AAACACTGTC 
GGCACAGGGA 
ATGTGTTCAG 
ACAAAACTGG 
TCTACCAACC 
CCCCTTGGCT 
CCCCTCCCTG 
CACCTAGGGA 
TAGCGTCTGG 
GGGTGGGTGG 
AGGATTGTGG 
AAAAAGATTT 
CAAAGTGATG 
GTGCTATTAA 



AGAAGAAGAA 
AATATATGTT 
TTTGCTTTTC 
CGGTCCCTCC 
CTCCCCCTCC 
GAAGAAGCTG 
CCTGTOGCAG 
AACCCTCAGA 
AGAAAGAGAA 
ATTATTAATA 
TACTAGTTAC 
TGGGTCCAGT 
GAGTCCACCC 
GATTGGGAGC 
GGACAAAGAA 
CAATAAGACA 
CTAGTACCTG 
AACCCTCCCA 
TCGGTCCACT 
ACATGCACAC 
CCCCTGTTCC 
AATCTCAGTA 
TGTCTCATAA 
CTTGAAGGCC 
CTTTGGGTGA 
GTAGGGGTCA 
AGGGCTTGGC 
CCAGCTTGCA 
CTCCTTTTGG 
GCCCTAGACC 
CAGATTCCCC 
GGTGGGGAGG 
AGACAGCCCT 
TCAGACTTAA 
CAAAAGCAAA 
TTGCACAAAA 
GAACTTAATT 



AAAAATACAT 
TGGGTTTACG 
TTGGGGGAGC 
TGGGACCCTT 
CCCTCAGGGG 
TAGGACTAAG 
GGGCAAGTGA 
GATGTGTGAG 
GAAAAAGAGA 
ATAATAATTC 
TTAGTAGACA 
AGAATGTATA 
TGAGTTGCCC 
TTCCAGTCTG 
ATGAGGGGTT 
GGGGCTGGGT 
GGCAGCCCAT 
TGCCTGCTTC 
TCTCTACAGA 
ACGCTTCTCA 
CATCCAGCTT 
ATTGTTCTTT 
AAGTTAGATT 
TGGGAGAAAG 
CCCTAACCCT 
TCTACTGTAC 

ATCACTGGGC 
CAGGAGGAGC 
CCAAACACAT 
CTGCTCTTTT 
GCTTCCCCTT 
GGGGCAGGGA 
TTTGAGGCAT 
ATAAAAGCGG 
TGCAGAGAAA 
TTATTAAAAG 
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Seq ID NO: 16 Protein Sequence 
Protein Accession #: XP_044166 



45 1 11 21 31 41 51 

I ! 1)11 

MVTGRREDVA TARREIISAA EHFSMIRASR NKSGAAFGVA PALPGQVTIR VRVPYRWGL 60 

WGPKGATIK RIQQQTNTYI ITPSRDRDPV FEITGAPGNV ERAREEIEtH IAVRTGKIljE 120 

YNNENDFLAG SPDAAIDSRY SDAWRVHQPG CKPLSTFRQN SLGCIGECGV DSGFEAPRLG 180 

50 EQGGDFGYGG YLFPGYGVGK QDVYYGVAET SPPLWAGQEN ATPTSVLFSS ASSSSSSSAK 240 

ARAGPPGAHR SPATSAGPEL AGLPRRPPGE PLQGFSKLGG GGLRSPGGGR DCMVCFESEV 300 

TAALVPCGHN LFCMECAVRI CERTDPECPV CHITATQAIR IFS 343 
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Seq ID NO: 17 DNA sequence 
Nucleic Acid Accession #: NM_005940 
Coding sequence: 23-1489 



I 

AAGCCCAGCA 
CGCCCTCCTG 
TCTGCCGCCG 
AGCCCTGCCC 
CAGCCTCAGG 
CCGACAGAAG 
GATCCTTCGG 
CCTAAAGGTA 
TGACATCATG 
TGGGGGCATC 
CGACTATGAT 
AGCCCATGAA 
GTCCGCCTTC 
TCAACACCTA 
CCAGGCTGGG 
CTGTGAGGCC 
GGGCTTTGTG 
TCGCCACTGG 
CATTTGGTTC 



11 
I 

GCCCCGGGGC 
CCCCCGATGC 
GACGTCCACC 
AGTAGCCCGG 
CCTCCCCGCT 
AGGTTCGTGC 
TTCCCATGGC 
TGGAGCGATG 
ATCGACTTCG 
CTGGCCCATG 
GAGACCTGGA 
TTTGGCCACG 
TACACCTTTC 
TATGGCCAGC 
ATAGACACCA 
TCCTTTGACG 
TGGCGCCTCC 
CAGGGACTGC 
TTCCAAGGTG 



21 
I 

GGATGGCTCC 
TGCTGCTGCT 
ACCTCCATGC 
CACCTGCCCC 
GTGGCGTGCC 
TTTCTGGCGG 
AGTTGGTGCA 
TGACGCCACT 
CCAGGTACTG 
CCTTCTTCCC 
CTATCGGGGA 
TGCTGGGGCT 
GCTACCCACT 
CCTGGCCCAC 
ATGAGATTGC 
CGGTCTCCAC 
GTGGGGGCCA 
CCAGCCCTGT 
CTCAGTACTG 



31 
I 

GGCCGCCTGG 
GCTCCAGCCG 
CGAGAGGAGG 
TGCCACGCAG 
CGACCCATCT 
GCGCTGGGAG 
GGAGCAGGTG 
CACCTTTACT 
GCATGGGGAC 
CAAGACTCAC 
TGACCAGGGC 
GCAGCACACA 
GAGTCTCAGC 
TGTCACCTCC 
ACCGCTGGAG 
CATCCGAGGC 
GCTGCAGCCC 
GGACGCTGCC 
GGTGTACGAC 



41 
I 

CTCCGCAGCG 
CCGCCGCTGC 
GGGCCACAGC 
GAAGCCCCCC 
GATGGGCTGA 
AAGACGGACC 
CGGCAGACGA 
GAGGTGCACG 
GACCTGCCGT 
CGAGAAGGGG 
ACAGACCTGC 
ACAGCAGCCA 
CCAGATGACT 
AGGACCCCAG 
CCAGACGCCC 
GAGCTCTTTT 
GGCTACCCAG 
TTCGAGGATG 
GGTGAAAAGC 



51 
I 

CGGCCGCGCG 
TGGCCCGGGC 
CCTGGCATGC 
GGCCTGCCAG 
GTGCCCGCAA 
TCACCTACAG 
TGGCAGAGGC 
AGGGCCGTGC 
TTGATGGGCC 
ATGTCCACTT 
TGCAGGTGGC 
AGGCCCTGAT 
GCAGGGGCGT 
CCCTGGGCCC 
CGCCAGATGC 
TCTTCAAAGC 
CATTGGCCTC 
CCCAGGGCCA 
CAGTCCTGGG 
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CCCCGCACCC 
GGGTCCCGAG 
CAGCACCCGG 
CTCTGAGATC 
CCTCTACTGG 
GGGTCCTGAC 
ATGCCCTCAG 
ATCTTTGTGG 
GGTGGGGTAC 
AGCGACTGTC 
GGGACCCGCT 
GTAGCACCAT 
TCCTTCCAGG 
TGAGCAACTG 
ATCTGTCTGC 
GTTCACAGTC 
CAACATACCT 
ATCCTCCAAA 
TTTTTAAACT 



CTCACCGAGC 
AAGAACAAGA 
CGTGTAGACA 
GACGCTGCCT 
AAGTTTGACC 
TTCTTTGGCT 
GGGTGCTGAC 
CTGTGGGCAC 
AACCACCATG 
TCAGACTGGG 
ATGCAGGTCC 
GGCAGGACTG 
GGCTGGCACT 
GGCTGTAGGG 
CTTCTGGCTG 
AAATGGGGAG 
CAATCCTGTC 
GCCATTGTAA 
GAGGATTGTC 



TGGGCCTGGT 
TCTACTTCTT 
GTCCCGTGCC 
TCCAGGATGC 
CTGTGAAGGT 
GTGCCGAGCC 
CCCTGCCAGG 
CAGGCATGGG 
ACAACTGCCG 
CAGGGAGGCT 
TGGCAAACCT 
GGGGAACTGG 
GAAGCAAGGG 
CAGGGCCACT 
ACAATCCTGG 
GGGTATTCTT 
CCAGGCCGGA 
ATGTGTGTAC 
ATTAAACACA 



GAGGTTCCCG 
CCGAGGCAGG 
CCGCAGGGCC 
TGATGGCTAT 
GAAGGCTCTG 
TGCCAACACT 
CCACGAATAT 
ACTGAGCCCA 
GGAGGGCCAC 
TTGGCATGAC 
GGCTGCCCTG 
AGTGTCCTTG 
TGCTGGGGCC 
TCCTGAGGTC 
AAATCTGTTC 
CATGCAGGAG 
TCCTCCTGAA 
AGTGTGTATA 
GTTGTTTTCT 



GTCCATGCTG 
GACTACTGGC 
ACTGACTGGA 
GCCTACTTCC 
GAAGGCTTCC 
TTCCTCTGAC 
CAGGCTAGAG 
TGTCTCCTGC 
GCAGGTCGTG 
TTAAGAGGAA 
TCTCATCCCT 
CTGTATCCCT 
CCATGGCCTT 
AGGTCTTGGT 
TCCAGAATCC 
ACCCCAGGCC 
GCCCTTTTCG 
AACCTTCTTC 



CCTTGGTCTG 
GTTTCCACCC 
GAGGGGTGCC 
TGCGCGGCCG 
CCCGTCTCGT 
CATGGCTTGG 
ACCCATGGCC 
AGGGGGATGG 
GTCACCTGCC 
GGGCAGTCTT 
GTCCCTCAGG 
GTTGTGAGGT 
CAGCCCTGGC 
AGGTGCCTGC 
AGGCCAAAAA 
CTGGAGGCTG 
CAGCACTGCT 
TTCTTTTTTT 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



Seq ID NO: 18 Protein sequence: 
Protein Accession th NP_00S931.1 



1 
I 

MAPAAWLRSA 
PAPATQEAPR 
LVQEQVRQTM 
FPPKTHREGD 
YPLSLSPDDC 
VSTIRGELFP 
QTWVYDGEKP 
PVPRRATDWR 
AEPANTFL 



11 
1 

AARALLPPML 
PASSLRPPRC 
AEALKVWSDV 
VHFDYDETWT 
RGVQHLYGQP 
FKAGFVWRLR 
VLGPAPLTEL 
GVPSEIDAAF 



21 
I 

LLLLQPPPLL 
GVPDPSDGLS 
TPLTFTEVHE 
IGDDQGTDLL 
WPTVTSRTPA 
GGQLQPGYPA 
GliVRFPVHAA 
QDADGYAYFL 



31 
I 

ARALPPDVHH 
ARNRQKRFVL 
GRADIMIDFA 
QVAAHEFGHV 
LGPQAGIDTN 
LASRHWQGLP 
LVWGPEKNKI 
RGRLYWKFDP 



41 
I 

LHAERRGPQP 
SGGRWEKTDL 
RYWHGDDLPF 
LGLQHTTAAK 
EIAPLEPDAP 
SPVDAAFEDA 
YFFRGRDYWR 
VKVKALEGFP 



51 
I 

WHAALPSSPA 
TYRILRFPWQ 
DGPGGILAHA 
ALMSAFYTFR 
PDACEASFDA 
QGHIWFFQGA 
FHPSTRRVDS 
RLVGPDFFGC 



Seq ID NO: 19 DNA sequence 

Nucleic Acid Accession fl: NMJJ05794.1 

Coding sequence: 434-1276 



11 
I 

CACGCTGTGA 
GGGTCCGTGC 
GGGGCCAGCA 
AAGAACCGTA 
GAACCCATCA 
TGGTGAGACT 
TGAGACTATC 
ACTATGCTGT 
TCTGTGAGGA 
GTGGTCACGG 
GGGGCCCACG 
CTGCAGGGGG 
CGGGAGCAGC 
AGCGCAGGGG 
AAGATCCTAA 
ATGGAGAACA 
GTGGCGCTGG 
GCATTGGAGC 
ACTGACTTCA 
CATCAGCTGC 
TCTCCAGATG 
CTCTGAGAGG 
GTCTAGGTGA 
GGGCTTACTC 



GGTTCCCTTC 
CTCACTCGTT 
CTTCACTCCT 
CACCATCTTT 
GTGAGGCCAA 
TATCACCAAG 
CACCAAGCGG 
GACACCAACC 
TGCTAGGCTT 
CCGGGTAGCC 
GGCCCGGGAC 
CATGGCCAAG 
GGCTGAGGAC 
CCTGGTGTGC 
GATCTGGGAC 
GCTGCCCTAC 
TAATCCAGTA 
TAGAACACTG 
AATTATAAAA 
CAAGGAACAT 
CTTCCTGTGC 
CTCCACTCGG 
CTGAGGGGGT 
AGCAATTTGG 
CC 



Seq ID NO: 20 Protein sequence: 
Protein Accession ft: NP__005765.l 



21 
I 

AGCTTTGTTC 
CACCTTTAAG 
AGACCACGAA 
ATACTCACCG 
ATTCCGTACA 
ATTGCCAAGC 
ACCTATCGCC 
CAGCAGTTGC 
TGAGCAGCAC 
GGTCCACCAG 
TGGTCATCAG 
AGGGGCTGAG 
TGGTGGCCAA 
TCAACCCTCT 
GTGTGAACGT 
GGAGGGGTGC 
GTGTCTACAA 
TGGCCCCCAA 
GCAAAGTGTT 
AGAGGATTGG 
CCAGCTACGT 
AGTGGGGGCG 
TCATTTGGAT 
ATGCTAGGCT 



31 

I 

TTTTGGTCTT 
AGCTGTAACA 
TGCACCGAGA 
CAAGGGTCTG 
CATTTTGGTG 
AGTGAGACTA 
AAGTGGCCTG 
CCGGGGCTAC 
CGGGATAGAC 
TGGGATCGGC 
CAGCCGGAAG 
TGTGGCGGGC 
GGCCCTGGAG 
GGTAGGGAGC 
GAAGTCCCCA 
TGTCATCCTG 
TGTCAGCAAG 
GGACATCCGG 
TCATGGGAAT 
GGAGTCAGAG 
CAACGGGGAG 
GCTGCGTAGC 
CTGGAGCAGA 
TGAGGAAGAA 



41 
1 

CATGATAAAT 
CTCACCGCGA 
GGAATGAACA 
CAACTTCATT 
ACTTTGAAGA 
TTGCCAAGTG 
ATTCAGCAGG 
CAGGGCTGGT 
AGGAAGGGCG 
TTTGCCATCG 
CAGCAGAACG 
ATTGTGTGCC 
CACTGTGGGG 
ACTCTGGGGA 
GCCCTGCTGC 
GTCTCTTCCA 
ACAGCGCTGC 
GTAAACTGCG 
GAGTCTCTCT 
GACTGTGCAG 
AACATTGCGG 
TGTGGTCCCA 
GTCTGCCATT 
GAAAAACGCT 



60 
120 
180 
240 
300 
360 
420 
480 



51 
1 

CTTGCTGCTG 
AGGTCTGCAA 
ACTCTGGACA 
CTTGAAGTCA 
GACTGTCACC 
GTGAGACCAT 
AAGCATCTCA 
TTCATCCCTG 
TCCTGGCTAA 
CCCGACGTCT 
TGGACCGGGC 
ACGTGGGGAA 
GCGTCGACTT 
CCAGTGAGCA 
TGAGCCAGTT 
TTGCAGCTTA 
TGGGTCTCAC 
TGGTTCCAGG 
GGAAGAACTT 
GAATCGTGTC 
TGGCAGGCTA 
GCCCAGGAGC 
CTGCCAGACT 
TCGGCATTCT 



1 11 

I I 
MLSAVARGYQ GWFHPCARLS 
AHW1SSRKQ QNVDRAMAKI* 
AGVNPLVGST LGTSEQIWDK 
ALGVYNVSKT ALLGLTRTLA 
QLQRIGESED CAGIVSFLCS 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



21 31 41 51 

| 1 I I 

VRMSSTGIDR KGVLANRVAV VTGSTSGIGF AIARRLARDG 
QGEGLSVAGI VCHVGKAEDR EQLVAKALEH CGGVDFLVCS 
ILSVNVKSPA LLLSQLLPYM ENRRGAVILV SSIAAYNPW 
LELAPKDIRV NCWPGIIKT DFSKVFHGNE SLWKNFKEHH 
PDASYVNGEN IAVAGYSTRL 



60 
120 
180 
240 



234 



WO 03/003906 



300 
360 
420 



Seq id NO: 21 DNA sequence 
.Nucleic Acid Accession fh BOS sequence 
Coding sequence: 77-904 

1 11 21 31 41 51 

I i 1 | 1 ] 

GCCGCGTTTT GCCTCCGCAG CAGCTCTGGG CTCTTCTCAG CTCCGCGAGC AGCTGCTCCA 60 

ATGCCCCGGA GTGGCCATGG GCGCCCCGCA CTGGTGGGAC CAGCTGCAGG CTGGTAGCTC 120 

GGAGGTGGAC TGGTGCGAGG ACAACTACAC CATCGTGCCT GCTATCGCCG AGTTCTACAA 180 

CACGATCAGC AATGTCTTAT TTTTCATTTT ACCGCCCATC TGCATGTGCT TGTTTCGTCA 240 
GTATGCAACA TGCTTCAACA GTGGCATCTA CTTAATCTGG ACTCTTTTGG TTGTAGTGGG 
AATTGGATCC GTCTACTTCC ATGCAACCCT TAGTTTCTTG GGTCAGATGC TTGATGAACT 
TGCAGTCCTT TGGGTTCTGA TGTGTGCTTT GGCCATGTGG TTCCCCAGAA GGTATCTACC 

AAAGATCTTT CGGAATGACC GGGGTAGGTT CAAGGTGGTG GTCAGTGTCC TGTCTGCGGT 480 

TACGACGTGC CTGGCATTTG TCAAGCCTGC CATCAACAAC ATCTCTCTGA TGACCCTGGG 540 

AGTTCCTTGC ACTGCACTGC TCATCGCAGA GCTAAAGAGG TGTGACAACA TGCGTGTGTT 600 

TAAGCTGGGC CTCTTCTCGG GCCTCTGGTG GACCCTGGCC CTGTTCTGCT GGATCAGTGA 660 

CCGAGCTTTC TGCGAGCTGC TGTCATCCTT CAACTTCCCC TACCTGCACT GCATGTGGCA 720 

CATCCTCATC TGCCTTGCTG CCTACCTGGG CTGTGTATGC TTTGCCTACT TTGATGCTGC 780 

CTCAGAGATT CCTGAGCAAG GCCCTGTCAT CAAGTTCTGG CCCAATGAGA AATGGGCCTT 840 

CATTGGTGTC CCCTATGTGT CCCTCCTGTG TGCCAACAAG AAATCATCAG TCAAGATCAC 900 

GTGATGGCAA GATGGTGGCT GGCTTCTCTG CTTATCGCCC CTCATGCAGT GGGCTTCCTT 960 

TGCTAGGAAG ACAGCCAAGG GAGTTCGAAT AGTTGGGGTG TGGGCTATCT TTTCAAAAAT 1020 

CTATTTGCTG GGGCTCTTAA TTTCTTTAGT GTTCTTTGTA TGTAGGGATT TAAACTTTGT 1080 

CATATGGTAC AAATATTCCC TGCCCCCCTG CAGTTTCCCA TTTGTCTTTC AGTATGTTAA 1140 

TATTTTTGTG CCATACTGGT TTTAAACTTT CATGTTGTCA CATCTGTTAA TCTTTTCTTT 1200 

AGGATTTCTG GATTTTGTGT AATTTTTAAA AAGGTCCCCT CCTCCTCCCT AATGTGTCTG 1260 

TGGACCACCT GGATTCCACT GTACAAGGGG AAAAGTGTCT ATTCCTTTCC CAAAGATGGA 1320 

AAATGGAGGG CTTAGGGACA CTAGATGCAT CTTTCTCAGC ATCACTTCCA GATGCAGTGA 1380 

CTTGTTGGGC TGCGTCCTTA ATGGCCATGG CAGAGCAGTC CCTTGGGGGA TCCAGCCCTG 1440 

TACAATGCAT CTCTTCCTGG AGAAAGCTGG CCTGCTCCAG ACCCCACCAT TCCCAGGCGC 1500 

CCTTGGAGTG GACTCTACTG ATGACAGACA GACCCTCTGA GAGACAAGAC CCTCTGACTC 1560 

TGTGATGGAA GATGCCAGAG ATTTTCCTTT GGGGTAATTG TCCTTAAACA AAACCAAACA 1620 

GATGAAACAC ACACAGGACT TGTGGCTAAA AAGGCTAGTT TTTCACTTGC ATTTCTCAAC 1680 

TAACCCAGGT TTTACATGCA TCTGTGAATC CTTTTACTAC TACCTCTGTG GAGAGATGGA 1740 

GAGACTTCAG ATAAACGTGA AGCTAATGAG TAAAACCCTC TCTGCCAAAA CCTACACTCC 1800 

ACTTTAGGCC CTTCTTGAAG ATGAGCACAA TTTTTAAATA CTGAGCACAA TTTTTAAATA 1860 

CTGACATCAC TTCCTCTTCC CCCTCCCACC CCAGCTCAGC AGCCTCAAAT CTACAGAGAA 1920 

GAAGAATTAT GGCATGAACA TTCCCACAGA CCCACCATCT TTAAGACTTG ACCTCTGTAA 1980 

GTTTACCAAA GGGCTCCTCA CAATTGTGGT GGGGGTTCTG GTTCAAAATT TGGAGCAAAC 2040 

ATGAAGTTTT TGGAAACGTT TTCTCATTTG AAGCCTCCAG TATGCTGTAC TATTCTGGAA 2100 

ATTACCTTCA AGAGTCTCAC TTCTTGTTTC TGTTGTGTTT TCTGTGGGCA TCATGTTCTT 2160 

CACGCTTGCA GTAGAAGGTG CTTTCTCGGT TTCCCAGAGT ATCCAACGGC TCACCTTTCT 2220 

CAAGTGCTGG CAGTAGCTAT GCACTCACGG GCTGGTTTGG GTCGCTGGTG CAGCAGCGCA 2280 

AATCTGTTGC CTTCTGAATT TTTCTCACCT AATGTGACAC TGGCTACAAT GAATCTTCTC 2340 

TTCATCGGGC TGAATGAAAG ATTCAAGAAC CATCTTCAAG GTGCATGGTG GGAATTATCA 2400 

ACCTCAGGGA TACTCATTTT AACTCAGGCG TGTCCTGCTT TGTAACATTC CATTGTTGGG 2460 

AGAGGGCAGG ACAGGTGTGT TCTTCTGTGG GCAGGAGTCA TGTCACTGTC CTACATATGT 2520 

AAGAGTTGGG AAGGTGACGA TTTTTGACAC ATCCAGGAAC TCTTACTCTA GTTAGAATTT 2580 

GTACCAGATC CAAGGTGAAA ACCCCAATAA GCAACTGAAT TTAGAGTTTA AAAATGAATG 2640 

ACTTTATGCT ACATCTGTGG TTATCAAATT ATATAGGTTG TTGAGAAGCA GAACGCTGTT 2700 

TGTAGTAAGA AATCTTTGTG GAACCCCAGT GTGTGAAGTA AATTGTATGT TATTAAATTT 2760 

ATTTAAGGTT AAATTTATGG CATTTACTTA ATAATATATG AGGTGGTGAA AATGCAAATT 2 820 

AACAAATTGG TAATTTCCAA GGTAGAAAAA TTAGGTGTTG AATGAATGTA TATGTTGGTT 2880 

TATTATAGTT TTATATATAT ATAGAGAGAG TGTTTTGTTT TTGAGTCAGG GTCTTGCTCT 2940 

ATTACCCAGG CTGGAGTGCA GTGGTGCAGT CATGACTCAC TGTAGCCTCT GTCTCCCAGG 3000 

CTCAAGTGAT CCTCTCACCT CAGCCTCCCC AGGAGCTGGG AGTACTGGTG CGTGCCTCCA 3060 

CTCCCAGCTA ATTTTTGTAT TTTTTCATAG AGATGGGGTT TCACCATTTT GCCCAGGCTG 3120 

GTCTCAAACT TCTGGGCTCA AGAGATTCGC CCGCCTCGGC CTCCCAAAGT GCTGGGAATA 3180 

GGCATGAGCC GTCACGCCTG GCCTAAAAAA TATTTTTAAA ATGATCTTTG AATTAAATAT 3240 

TCGTAGAATT TCTAATGTAT CTCTTTGAGA CCTAGGAGGT TGATGGAACA GAACTGCTGT 3300 

TAAGTCCTTT GGGTTTCAAG TCTAGAATTT TTTAAAGGCA AATATCAGCT CATTCTTATT 3360 

TTAGATTGAC CTTATCAGGC ATGGATTCTG GTCTCATCTA CTTTATGGTA TAAATGCTCC 3420 

AAGGTAGGGG GTTTGGTATA TATTTTAAGC CCGGCTTTTT TTTTTTTTTT TTTTTTTTTT 3480 

TTTTTAATGT GAGAAGCAGA ATGTGCTTCT AGAAACTGGT TTTAAAGAGA TGAGCTGAGA 3540 

AAGAAATGTG GAATGGAGTA TATTTGAGGA GGACAAAACA TAACTTCACT TTTGAACAGA 3600 

AATCACTCTA GCTTGCCAGC ATGGGATGTA AACCAAGAGA GTAGAAATAT ACCCATCTTA 3660 

TTTTAAGTTG GGTTTATGGC ATCGCTCATA TATGTAAAAG CACTACAAAC TCTTTAAAGA 3720 

AAATTGGGAA ACTACAGAGA AGTCAAAGAA AAAAAAAAGT AACCCATATT TCTATTGCCC 3780 

AGGTATAATC CTTGTTAATA TTTTGGTTTG GTCTCCTCTT TTTTTCCCCC AATATAGTTG 3840 

TAAATAAATG ATGTCTTTCA GAGTTGACAT TTATCCTGTA GCTTGAATGG CATGTAAATG 3900 

CCAGTTGTAT ATTTTTTCAT GAAGTGTAGG TTTGGAATAC ACTAGAGTTA GCTATATGCT 3960 

TGAATGCTGA TCACTGGATT CTGAGACTGA CTACTGAGTC TACCTTTTTA ATCAAGCCTA 4020 

ACATGAATGG GCTCCAAAAA GTAATGAATG TAATTGTACT TTTTGATGTG CCTCTGCACT 4080 

TGGCTTGGTG AGTCATCATA AATAGCTGTT AAATATGTGA CTTTACAGAT TTTGATATGT 4140 

TCAGATTGTA AAAAATGAAT AGTTTATTTC ATTAATTGAT GGGCAGTCAA GAATCTCCCT 4200 
CC 



235 



WO 03/003906 



Seq ID NO: 22 Protein sequence: 
Protein Accession ft: EOS sequence 

I 11 21 31 41 51 

II I I I I 

MGAPHWWDQL QAGSSEVDWC EDNYTIVPAI AEFYNTISNV LFFILPPICM CLFRQYATCF 60 
NSGIYLIWTL LVWGIGSVY PHATLSFLGQ MLDELAVLWV LMCALAMWFP RRYLPKIFRN 120 
DRGRFKVWS VLSAVTTCLA FVKPAINNIS LMTLGVPCTA LLIAELKRCD NMRVFKLGLF 180 
SGLWWTLALF CWISDRAFCE LLSSFNFPYL HCMWHILICL AAYLGCVCFA YFDAASEIPE 240 
QGPVIKFWPN EKWAFIGVPY VSLLCANKKS SVK1T 



Seq ID NO: 23 DNA sequence 

Nucleic Acid Accession tt: CAT cluster 



1 11 21 31 41 51 

! I I.I I I 

TGAAGTATCT CTGTTTAATG TCATATATCT GAGATGTTCT GTGGAGCCAG AATTCAAAAC 60 

CCAGATACGC AGGAGGACAC ATTCTTGATC GGTATTTTAT TCTATTATTT TTATTAGTTC 120 

CAATCCTTAT GACTCTGTTT AAAATGAGTA TCTGCATTAT TGTGCAAACA CTTCTGAGAT 180 

CTCATCACAT ACCTGATCCC ATTTAACTTC AGTCTATCCC TCCATTACTT TTTGTGGAAA 240 

GTGCTATCTA AGTTGGAGGT TCTGCTTGAG TATTCTTAAT TTTTCTTCTT TTTAATGCAT 300 

ACGGTGGTCA AAGTAAATAG CTAACATATT GGCATCAAAC CATTGCCTAT TGAAGCAAGC 360 

TTTGAATGAA ATTGGTTTGT CATTGACCAC TGTAAGAAAC TCATTTGTAA AGATAATACT 420 

GAAAAACATC CTGACTCTTA TGTATGCTTG CCTGTCTCTG AACTCCACCT CTATCAGCAT 480 

ACTCAATACA TTGCCTCTTT CCAACATTTA TCTCCTATTT CAGGAGAAAG TTTTGTGGGA 540 

GCGTCTACCA CGTTGCAAAG ATTCAACATT TCTCTTAAAA ATAGGAGTTT CTTTTAGATA 600 

TAAGCAATGA AATTTCAAGC AACAATATTA CTTCTCTCAT GTCCTTTGCT TTTTTAAGAT 660 

CTGAACCTCA AGATTCAAAT GGATTTTCAT TGAAATTCAC CACGATCCCC TCTGCAGCTG 720 

TGGTCTGAAA GTACCATTTC TGCATTTATT TTAGCCCATG GAATAACTGT GCTGAGAAAC 780 

CACAGAGTCA ATCAGATTCA AAATGTTAAA ATCCTTCCTG CTTGGAGTTT TCCGTCTTCA 840 

CATCAAAGCA TTTCATGCCC GTCAGCAACT TTTTAATGCA TTTGCTCATG GTTTGCAGAA 900 
TTTCCATTTA AAGACTTTCC TTGGCTGACT TCTCTGATGA GGTTTCCTGC TTGCCTC 



Seq ID NO: 24 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 



GTGTGTAGTC 
AGTAGGCTCA 
GAAAGTGGTT 
ACTACTTGGC 
ACCCTCTGTG 
CTGTTCCTGG 
AAAGGAACAT 
CCAACAGTCC 
CCTGGTGCAA 
GTCCTCCGGA 
GCAAGCAGGA 
AAACGAGGAG 
CGGAAAACTC 
CAGCACAGTT 
CAGAGGGGAT 
AAAAAGCAAA 
AAAGAAACTC 
AAAACTTTCT 
ATAGAGGTGG 
ATCTTTACAA 
TTCAATAGGC 
TAAAAAGAAG 
CAAAAAGTAA 
GAAGTGTTTG 
ATAAAAATAA 
AATTCTGGCT 



11 
I 

GGGTATGAGA 
GTAGCACGAA 
TAGCATGCTG 
CTCTGGGACT 
AGGAGGATGC 
TAAGCTCCTG 
CTACACACAG 
CCATTCTACT 
AGAACAGTCG 
AAAGGGACGT 
AACCTCATCA 
CAAATGCATT 
CAAGCAGGAA 
ATTCCATGGG 
CGTGGTGAAT 
GGACATGAGA 
CTATTTTTAA 
CCTGAAATAG 
AGTTCAGAGA 
ATGAGTTTCT 
AATGGTTTGA 
AAAAATTAAG 
TGGAGGGATA 
CACAATAATG 
TAGAATAAAA 
CCACAGAACT 



21 
I 

ACGATTGCAA 
CTGCAAATAC 
GAGCTGGTTT 
CAGATTCCTC 
TGTGAGGACA 
GAAAGAAGTC 
AGGAAATGAA 
TGCAGAAAGG 
TTTCTCCAAA 
GAAGCCCCCT 
GAGAAGTCAG 
AAAAAGTTGC 
GGATTTTAAC 
CTAAAATAAA 
TTCAATGAAA 
GAAGTAATAT 
GAGAAATGTT 
GAGATAAATG 
CAGGCAAGCA 
TACAGTGGTC 
TGCCAATATG 
AATACTCAAG 
GACTGAAGTT 
CAGATACTCA 
TACCGATCAA 
TGTCAGATAT 



31 
i 

GCAAAAGCAG 
TTGAGAAGGG 
TCTGCTTGGT 
CTCTTTAAAA 
AAATGAGATC 
TATGGATTAT 
TAACATGAAG 
TTGCTTACAC 
ACTGAAGCTG 
TTGTAATTTC 
CCAAGGAAAG 
TGACGGGCAT 
ATTTTGAATC 
TGCAGAAATG 
ATCCATTTGA 
TGTTGCTTGA 
GAATCTTTGC 
TTGGAAAGAG 
TACATAAGAG 
AATGACAAAC 
TTAGCTATTT 
CAGAACCTCC 
AAATGGGATC 
TTTTAAACAG 
GAATGTGTCC 
ATGACATTAA 



41 
I 

CTGAGGGACG 
AAAAGTTCAG 
AGCCCTACAA 
TGGTGCTAAT 
ATCCACATAA 
ACTCAACCTA 
TGAAGTCTTC 
TGAAAATCAG 
GAAATTATCT 
TGCATTAGCG 
TCTTTAAATG 
GAAATGCTTT 
TGATTGACTC 
GTACTTTCAG 
ATCTTGAGGT 
AATTTCATTG 
AACGTGGTAG 
GCAATGTATT 
TCAGGATGTT 
CAATTTCATT 
ACTTTGACCA 
AACTTAGATA 
AGGTATGTGA 
AGTCATAAGG 
TCCTGCGTAT 
AC 



51 
I 

CTGTCCAAGG 

CCTTGTGCTG 

CTTTGGCCCA 

AATAGCACCC 

GCCGTGAACC 

CACTCCAGTT 

ATCTCCATTC 

TTTATTTTCC 

GAAATATCAG 

TGCTCTCCTG 

GAAATTGTGC 

GATGTGAAGA 

TGTGGTTTCT 

ACCACAGCTG 

TCAGATCTTA 

CTTATATCTA 

ACGCTCCCAC 

GAGTATGCTG 

TTTCAGTATT 

CAAAGCTTGC 

CCGTATGCAT 

GCACTTTCCA 

TGAGATCTCA 

ATTGGAACTA 

CTGGGTTTTG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 



Seq ID NO: 25 DNA sequence 

Nucleic Acid Accession Eos sequence 



1 11 i 1 r i 1 r 

GTGTGTAGTC GGGTATGAGA ACGATTGCAA GCAAAAGCAG CTGAGGGACG CTGTCCAAGG 60 

AGTAGGCTCA GTAGCACGAA CTGCAAATAC TTGAGAAGGG AAAAGTTCAG CCTTGTGCTG 120 

GAAAGTGGTT TAGCATGCTG GAGCTGGTTT TCTGCTTGGT AGCCCTACAA CTTTGGCCCA 180 

ACTACTTGGC CTCTGGGACT CAGATTCCTC CTCTTTAAAA TGGTGCTAAT AATAGCACCC 240 

ACCCTCTGTG AGGAGGATGC TGTGAGGACA AAATGAGATC ATCCACATAA GCCGTGAACC 300 

CTGTTCCTGG TAAGGAAATG AATAACATGA AGTGAAGTCT TCATCTCCAT TCCCAACAGT 360 

CCCCATTCTA CTTGCAGAAA GGTTGCTTAC ACTGAAAATC AGTTTATTTT CCCCTGGTGC 420 

AAAGAACAGT CGTTTCTCCA AAACTGAAGC TGGAAATTAT CTGAAATATC AGGTCCTCCG 480 

236 



WO 03/003906 



PCTYUS02/21338 



GAAAAGGGAC GTGAAGCCCC CTTTGTAATT TCTGCATTAG CGTGCTCTCC TGGCAAGCAG 540 
GAAACCTCAT CAGAGAAGTC AGCCAAGGAA AGTCTTTAAA TGGAAATTGT GCAAACGAGG 600 
AGCAAATGCA TTAAAAAGTT GCTGACGGGC ATGAAATGCT TTGATGTGAA GACGGAAAAC 660 
TCCAAGCAGG AAGGATTTTA ACATTTTGAA TCTGATTGAC TCTGTGGTTT CTCAGCACAG 720 
TTATTCCATG GGCTAAAATA AATGCAGAAA TGGTACTTTC AGACCACAGC TGCAGAGGGG 780 

ATCGTGGTGA ATTTCAATGA AAATCCATTT GAATCTTGAG GTTCAGATCT TAAAAAAGCA 840 
AAGGACATGA GAGAAGTAAT ATTGTTGCTT GAAATTTCAT TGCTTATATC TAAAAGAAAC 900 

TCCTATTTTT AAGAGAAATG TTGAATCTTT GCAACGTGGT AGACGCTCCC ACAAAACTTT 960 

CTCCTGAAAT AGGAGATAAA TGTTGGAAAG AGGCAATGTA TTGAGTATGC TGATAGAGGT 1020 

IU GGAGTTCAGA GACAGGCAAG CATACATAAG AGTCAGGATG TTTTTCAGTA TTATCTTTAC 1080 

AAATGAGTTT CTTACAGTGG TCAATGACAA ACCAATTTCA TTCAAAGCTT GCTTCAATAG 1140 

GCAATGGTTT GATGCCAATA TGTTAGCTAT TTACTTTGAC CACCGTATGC ATTAAAAAGA 1200 

AGAAAAATTA AGAATACTCA AGCAGAACCT CCAACTTAGA TAGCACTTTC CACAAAAAGT 1260 

AATGGAGGGA TAGACTGAAG TTAAATGGGA TCAGGTATGT GATGAGATCT CAGAAGTGTT 1320 

ID TGCACAATAA TGCAGATACT CATTTTAAAC AGAGTCATAA GGATTGGAAC TAATAAAAAT 1380 

AATAGAATAA AATACCGATC AAGAATGTGT CCTCCTGCGT ATCTGGGTTT TGAATTCTGG 1440 
CTCCACAGAA CTTGTCAGAT ATATGACATT AAAC 

n Seq ID NO: 26 DNA sequence 
ZU Nucleic Acid Accession #: NM_0132B2.2 
Coding sequence: 85.. 2466 

1 11 21 31 41 51 

IK 1 I I I I | 

LJ CGACTCCTTA GAGCATGGCA TGGCTCAGAG GTGCTGGTAA AACTGATGGG GGTTTTTGCT 60 

GTCCCTCCCC TCAGCGCCGA CACCATGTGG ATCCAGGTTC GGACCATGGA CGGGAGGCAG 120 

ACCCACACGG TGGACTCGCT GTCCAGGCTG ACCAAGGTGG AGGAGCTGAG GCGGAAGATC 180 

CAGGAGCTGT TCCACGTGGA GCCAGGCCTG CAGAGGCTGT TCTACAGGGG CAAACAGATG 240 

GAGGACGGCC ATACCCTCTT CGACTACGAG GTCCGCCTGA ATGACACCAT CCAGCTCCTG 300 

GTCCGCCAGA GCCTCGTGCT CCCCCACAGC ACCAAGGAGC GGGACTCCGA GCTCTCCGAC 360 

ACCGACTCCG GCTGCTGCCT GGGCCAGAGT GAGTCAGACA AGTCCTCCAC CCACGGCGAG 420 

GCGGCCGCCG AGACTGACAG CAGGCCAGCC GATGAGGACA TGTGGGATGA GACGGAATTG 480 

GGGCTGTACA AGGTCAATGA GTACGTCGAT GCTCGGGACA CGAACATGGG GGCGTGGTTT 540 

GAGGCGCAGG TGGTCAGGGT GACGCGGAAG GCCCCCTCCC GGGACGAGCC CTGCAGCTCC 600 

J J ACGTCCAGGC CGGCGCTGGA GGAGGACGTC ATTTACCACG TGAAATACGA CGACTACCCG 660 

GAGAACGGCG TGGTCCAGAT GAACTCCAGG GACGTCCGAG CGCGCGCCCG CACCATCATC 720 

AAGTGGCAGG ACCTGGAGGT GGGCCAGGTG GTCATGCTCA ACTACAACCC CGACAACCCC 780 

AAGGAGCGGG GCTTCTGGTA CGACGCGGAG ATCTCCAGGA AGCGCGAGAC CAGGACGGCG 840 

. CGGGAACTCT ACGCCAACGT GGTGCTGGGG GATGATTCTC TGAACGACTG TCGGATCATC 900 

4U TTCGTGGACG AAGTCTTCAA GATTGAGCGG CCGGGTGAAG GGAGCCCCAT GGTTGACAAC 960 

CCCATGAGAC GGAAGAGCGG GCCGTCCTGC AAGCACTGCA AGGACGACGT GAACAGACTC 1020 

TGCCGGGTCT GCGCCTGCCA CCTGTGCGGG GGCCGGCAGG ACCCCGACAA GCAGCTCATG 1080 

TGCGATGAGT GCGACATGGC CTTCCACATC TACTGCCTGG ACCCGCCCCT CAGCAGTGTT 1140 

. CCCAGCGAGG ACGAGTGGTA CTGCCCTGAG TGCCGGAATG ATGCCAGCGA GGTGGTACTG 1200 

4D GCGGGAGAGC GGCTGAGAGA GAGCAAGAAG AAGGCGAAGA TGGCCTCGGC CACATCGTCC 1260 

TCACAGCGGG ACTGGGGCAA GGGCATGGCC TGTGTGGGCC GCACCAAGGA ATGTACCATC ' 1320 

GTCCCGTCCA ACCACTACGG ACCCATCCCG GGGATCCCCG TGGGCACCAT GTGGCGGTTC 1380 

CGAGTCCAGG TCAGCGAGTC GGGTGTCCAT CGGCCCCACG TGGCTGGCAT ACACGGCCGG 1440 

AGCAACGACG GAGCGTACTC CCTAGTCCTG GCGGGGGGCT ATGAGGATGA CGTGGACCAT 1500 

JU GGGAATTTTT TCACATACAC GGGTAGTGGT GGTCGAGATC TTTCCGGCAA CAAGAGGACC 1560 

GCGGAACAGT CTTGTGATCA GAAACTCACC AACACCAACA GGGCGCTGGC TCTCAACTGC 1620 

TTTGCTCCCA TCAATGACCA AGAAGGGGCC GAGGCCAAGG ACTGGCGGTC GGGGAAGCCG 1680 

GTCAGGGTGG TGCGCAATGT CAAGGGTGGC AAGAATAGCA AGTACGCCCC CGCTGAGGGC 174 0 

AACCGCTACG ATGGCATCTA CAAGGTTGTG AAATACTGGC CCGAGAAGGG GAAGTCCGGG 1800 

J J TTTCTCGTGT GGCGCTACCT TCTGCGGAGG GACGATGATG AGCCTGGCCC TTGGACGAAG 1860 

GAGGGGAAGG ACCGGATCAA GAAGCTGGGG CTGACCATGC AGTATCCAGA AGGCTACCTG 1920 

GAAGCCCTGG CCAACCGAGA GCGAGAGAAG GAGAACAGCA AGAGGGAGGA GGAGGAGCAG 1980 

CAGGAGGGGG GCTTCGCGTC CCCCAGGACG GGCAAGGGCA AGTGGAAGCG GAAGTCGGCA 2040 

GGAGGTGGCC CGAGCAGGGC CGGGTCCCCG CGCCGGACAT CCAAGAAAAC CAAGGTGGAG 2100 

OU CCCTACAGTC TCACGGGCCA GCAGAGCAGC CTCATCAGAG AGGACAAGAG CAACGCCAAG 2160 

CTGTGGAATG AGGTCCTGGC GTCACTCAAG GACCGGCCGG CGAGCGGCAG CCCGTTCCAG 2220 

TTGTTCCTGA GTAAAGTGGA GGAGACGTTC CAGTGTATCT GCTGTCAGGA GCTGGTGTTC 2280 

CGGCCCATCA CGACCGTGTG CCAGCACAAC GTGTGCAAGG ACTGCCTGGA CAGATCCTTT 2340 

£ _ CGGGCACAGG TGTTCAGCTG CCCTGCCTGC CGCTACGACC TGGGCCGCAG CTATGCCATG 2400 

03 CAGGTGAACC AGCCTCTGCA GACCGTCCTC AACCAGCTCT TCCCCGGCTA CGGCAATGGC 2460 

CGGTGATCTC CAAGCACTTC TCGACAGGCG TTTTGCTGAA AACGTGTCGG AGGGCTCGTT 2520 

CATCGGCACT GATTTTGTTC TTAGTGGGCT TAACTTAAAC AGGTAGTGTT TCCTCCGTTC 2580 

CCTAAAAAGG TTTGTCTTCC TTTTTTTTTA TTTTTATTTT TCAAATCTAT ACATTTTCAG 2640 

GAATTTATGT ATTCTGGCTA AAAGTTGGAC TTCTCAGTAT TGTGTTTAGT TCTTTGAAAA 2700 

7U CATAAAAGCC TGCAATTTCT CGACAAAACA ACACAAGATT TTTTAAAGAT GGAATCAGAA 2760 

ACTACGTGGT GTGGAGGCTG TTGATGTTTC TGGTGTCAAG TTCTCAGAAG TTGCTGCCAC 2820 

CAACTCTTTA AGAAGGCGAC AGGATCAGTC CTTCTCTAGG GTTCTGGCCC CCAAGGTCAG 2880 

AGCAAGCATC TTCCTGACAG CATTTTGTCA TCTAAAGTCC AGTGACATGG TTCCCCGTGG 2940 

_ - TGGCCCGTGG CAGCCCGTGG CATGGCGTGG CTCAGCTGTC TGTTGAAGTT GTTGCAAGGA 3000 

/0 AAAGAGGAAA CATCTCGGGC CTAGTTCAAA CCTTTGCCTC AAAGCCATCC CCCACCAGAC 3060 

TGCTTAGCGT CTGAGATCCG CGTGAAAAGT CCTCTGCCCA CGAGAGCAGG GAGTTGGGGC 3120 

CACGCAGAAA TGGCCTCAAG GGGACTCTGC TCCACGTGGG GCCAGGCCTG TGACTGACGC 3180 

TGTCCGACGA AGGCGGCCAC GGACGGACGC CAGCACACGA AGTCACGTGC AAGTGCCTTT 3240 
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GATTCGTTCC TTCTTTCTAA AGACGACAGT CTTTGTTGTT AGCACTGAAT TATTGAAAAT 3300 

GTCAACCAGA TTCTAGAAAC TGCGGTCATC CAGTTCTTCC TGACACCGGA TGGGTGCTTG 3360 

GGAACCGTTT GAGCCTTATA GATCATTTAC ATTCAATTTT TTTAACTCAG CAAGTGAGAA 3420 

CTTACAAGAG GGTTTTTTTT TAATTTTTTT TTCTCTTAAT GAACACATTT TCTAAATGAA 3480 

TTTTTTTTGT AGTTACTGTA TATGTACCAA GAAAGATATA ACGTTAGGGT TTGGTTGTTT 3540 

TTGTTTTTGT ATTTTTTTTC TTTTGAAAGG GTTTGTTAAT TTTTCTAATT TTACCAAAGT 3600 

TTGCAGCCTA TACCTCAATA AAACAGGGAT ATTTTAAATC ACATACCTGC AGACAAACTG 3660 

GAGCAATGTT ATTTTTAAAG GGTTTTTTTC ACCTCCTTAT TCTTAGATTA TTAATGTATT 3720 

AGGGAAGAAT GAGACAATTT TGTGTAGGCT TTTTCTAAAG TCCAGTACTT TGTCCAGATT 3780 

TTAGATTCTC AGAATAAATG TTTTTCACAG ATTGAAAAAA AAAAAAAA 3828 

Seq ID NO: 27 Protein Sequence 
Protein Accession #: NP_037414.2 

1 ll 21 31 41 51 

MWIQVRTMDG RQTHTVDSLS RLTKVEELRR KIQELFHVEP GLQRLFYRGK QMEDGHTLFD 60 

YEVRLNDTIQ LLVRQSLVLP HSTKERDSEL SDTDSGCCLG QSESDKSSTH GEAAAETDSR 120 

PADEDMWDET ELGLYKVNEY VDARDTNMGA WFEAQWRVT RKAPSRDEPC SSTSRPALEE 180 

DVIYHVKYDD YPENGWQMN SRDVRARART IIKWQDLEVG QWMLNYNPD NPKERGFWYD 240 

AEISRKRETR TARELYANW LGDDSLNDCR IIFVDEVFKI ERPGEGSPMV DNPMRRKSGP 300 

SCKHCKDDVN RLCRVCACHL CGGRQDPDKQ LMCDECDMAF HIYCLDPPLS SVPSEDEWYC 360 

PECRNDASEV VLAGERLRES KKKAKMASAT SSSQRDWGKG MACVGRTKEC TIVPSNHYGP 420 

IPGIPVGTMW RFRVQVSESG VHRPHVAGIH GRSNDGAYSL VLAGGYEDDV DHGNFFTYTG 480 

SGGRDLSGNK RTAEQSCDQK LTNTNRALAL NCFAPINDQE GAEAKDWRSG KPVRWRNVK 540 

GGKNSKYAPA EGNRYDGIYK WKYWPEKGK SGFLVWRYLL RRDDDEPGPW TKEGKDRIKK 600 

LGLTMQYPEG YLEALANRER EKENSKREEE EQQEGGFASP RTGKGKWKRK SAGGGPSRAG 660 

SPRRTSKKTK VEPYSLTAQQ SSLIREDKSN AKLWNEVLAS LKDRPASGSP FQLFLSKVEE 720 

TFQCICCQEL VFRPITTVCQ HNVCKDCLDR SFRAQVFSCP ACRYDLGRSY AMQVNQPLQT 780 
VLNQLFPGYG NGR 



Seq ID NO: 2B DNA sequence 

Nucleic Acid Accession #: NM_000756.1 

Coding sequence -.186. .776 



1 11 21 31 41 51 

AGAAACTCAG AGACCAAGTC CATTGAGAGA CTGAGGGGAA AGAGAGGAGA GAAAGAAAAA 60 

GAGAGTGGGA ACAGTAAAGA GAAAGGAAGA CAACCTCCAG AGAAAGCCCC CGGAGACGTC 120 

TCTCTGCAGA GAGGCGGCAG CACCCGGCTC ACCTGCGAAG CGCCTGGGAA GCGAGTGCCC 180 

CTAACATGCG GCTGCCGCTG CTTGTGTCCG CGGGAGTCCT GCTGGTGGCT CTCCTGCCCT 240 

GCCCGCCATG CAGGGCGCTC CTGAGCCGCG GGCCGGTCCC GGGAGCTCGG CAGGCGCCGC 3 00 

AGCACCCTCA GCCCTTGGAT TTCTTCCAGC CGCCGCCGCA GTCCGAGCAG CCCCAGCAGC 3 60 

CGCAGGCTCG GCCGGTCCTG CTCCGCATGG GAGAGGAGTA CTTCCTCCGC CTGGGGAACC 420 

TCAACAAGAG CCCGGCCGCT CCCCTTTCGC CCGCCTCCTC GCTCCTCGCC GGAGGCAGCG 480 

GCAGCCGCCC TTCGCCGGAA CAGGCGACCG CCAACTTTTT CCGCGTGTTG ™CAGCAGC 540 

TGCTGCTGCC TCGGCGCTCG CTCGACAGCC CCGCGGCTCT CGCGGAGCGC GGCGCTAGGA 600 

ATGCCCTCGG CGGCCACCAG GAGGCACCGG AGAGAGAAAG GCGGTCCGAG GAGCCTCCCA 660 

TCTCCCTGGA TCTCACCTTC CACCTCCTCC GGGAAGTCTT GGAAATGGCC AGGGCCGAGC 720 

AGTTAGCACA GCAAGCTCAC AGCAACAGGA AACTCATGGA GATTATTGGG AAATAAAACG 780 

GTGCGTTTGG CCAAAAAGAA TCTGCATTTA GCACAAAAAA AATTTAAAAA AATACAGTAT 840 

TCTGTACCAT AGCGCTGCTC TTATGCCATT TGTTTATTTT TATATAGCTT GAAACATAGA 900 

GGGAGAGAGG GAGAGAGCCT ATACCCCTTA CTrAGCATGC ACAAAGTGTA TT^CGTGCA 960 

GCAGCAACAC AATGTTATTC GTTTTGTCTA CGTTTAGTTT CCGTTTCCAG GTGTTTATAG 1020 

TGGTGTTTTA AAGAGAATGT AGACCTGTGA GAAAACGTTT TGTTTGAAAA AGCAGACAGA 1080 

AGTCACTCAA TTGTTTTTGT TGTGGTCTGA GCCAAAGAGA ATGCCATTCT CTTGGGTGGG 1140 

TAAGACTAAA TCTGTAAGCT CTTTGAAACA ACTTTCTCTT GTAAACGTTT CAGTAATAAA 1200 

ACATCTTTCC AGTCCTTGGT CAGTTTGGTT GTGTAAGAGA ATGTTGAATA CTTATATTTT 1260 
TAATAAAAGT TGCAAAGGT 



Seq ID NO: 29 Protein Sequence 
Protein Accession ft: NP_000747.l 



31 41 51 



60 

120 

180 



MRLPLLVSAG VLLVALLPCP PCRALLSRGP VPGARQAPQH PQPLDFFQPP PQSEQPQQPQ 
ARPVLLRMGE EYFLRLGNLN KSPAAPLSPA SSLLAGGSGS RPSPEQATAN FFRVLLQQLL 
LPRRSLDSPA ALAERGARNA LGGHQEAPER ERRSEEPPIS LDLTFHLLRE VLEMARAEQL 
AQQAHSNRKL MEIIGK 

Seq ID NO: 30 DNA sequence 

Nucleic Acid Accession #: CAT cluster 

3. 11 21 31 41 SI 

ATTTCTTGAG ATGGCTCCTC TTGGAACTAT GCATCTCTCC TGATACTTGG ATGCTTTTCC 60 
TCTGACTGAT GAAGATCCTG AATACCAAAG AGGGCCGCTG AAAGGGCTCT AGGAGTACAC 120 
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CTTCT AGGAA CCCTAAGCCA GAGAGAGGCT TCACTACATC ATGCTTCCTG ACATCTCTCC 180 

CTTTGAAGAG CAGTCAGACT CCTGCTTTGC TCTTCAGACT TAATTTGGGG GTTTAACAGG 240 

TGAGGTTGCT GGGGGAACTC TTTTACAACA TCTCTCTGAA AGAATCCGGG CTGCCAGTTT 300 

5CATTTGGTTT GGGTGTCAGT AGCATGATGG AAAGACAAAA AAACACAACT TGACATCTGC 360 

AGAAATGGGT TCAAATTTTA CCTGCAACTC ACCAATTCTG TGGCCTTGGT TCAGCAATTA 420 

AACTCCCTAA AATTCAGTTT TTTCTTTGTA AAATGGGGTT ATGAACAGTA CCTACTTCAA 480 

AATGTGTTTG TGAAGATTAA AAAAGTTAAC ATAAAGAGTT TAGAAGAGTG TCTGGCATAT 540 
TGTGCTCAAT AAGTGTTTAT TTATTTATTG CTGAATAAAC CAGTAATTTA ATTAGTAT 

10 Seq ID NO: 31 DNA sequence 

Nucleic Acid Accession Eos sequence 
Coding sequence: 134.. 1917 

1C 1 11 21 31 41 51 

CACCTCAGCC CCAACCCCAA CCCCAGCCGC ATCCCCTGCC CCAGCTGACG GGTCAAAGCC 60 

TCAGGAGAGT GTGGCTCTCC CCAGGCGCTA CCAGGAGGGG CAGGTCTCAG CCAGCTGGGG 120 

AAACCTTATT GCCATGGTTC TTAGAAGCCA CCCCTTCCCC AGGCAAGACA GGCCCCAAGG 180 

GAGTGTCCCG AGGGCGGTTC CCGGGAGCCC CGTGGGTCCC AGCACTTCCA CACACTCTGA 240 

ZU GGACAGACAC GGCCCCTCTT CTTCAGTGGG GACAGTCATA GGGACAGGTA CAGGGGGCCT 300 

GGTTGAGGCT GGAGGTCAGC CACAGCCAAG AAGCTCCGAG ACCAACGGAT CGCCCAGCCC 360 

AGACCCTCCC CCAGGCCTAA GAGGAGAGGG AACCAGGGAG AAAAGTCTAG ACCCGCTGCC 420 

CCAAGCCGCG ATGCCCAGGG GCCCCGCACA GCCCCCCGCG CAGAGGCCGC CTGGCCCCGC 480 

GGCCTCCTCC TCTGCGAGGC GCTCACAGCC GGTACCCCAG CTACGGAAAC GCAGCAGGTG 540 

25 CGAAATCGCC CCGAGCTCGG AGCAGGAGGT CAGGCCGGCC GCCTCGGGGG ACCCTCAAGG 600 

GGAGGCGCCG GGGGAGGGGG GCAGCCCTGC CGGCCGCAGC GGGGCGCTCA CGGAAAAGCA 660 

GGAGGAGGCC CGGAAGCTCA TGGTGTTTCT GCAGAGGCCC GGGGGTTGGG GGGTGGTGGA 720 

GGGGCCCCGG AAGCCCAGCT CCCGGGCCCT GGAGCCCGCC ACGGCGGCAG CCCTGCGGCG 780 

GCGGCTGGAC CTGGGCAGTT GCCTGGACGT GCTGGCCTTT GCCCAGCAGC ACGGAGAGCC 840 

3U CGGCCTGGCG CAGGAGACCT ACGCGCTGAT GAGCGACAAC CTGCTGCGAG TGCTGGGAGA 900 

CCCGTGCCTC TACCGCCGGC TGAGCGCGGC CGACCGCGAG CGCATCCTCA GCCTGCGGAC 960 

CGGCCGGGGC CGGGCGGTGC TGGGCGTCCT CGTACTGCCC AGCCTCTACC AGGGGGGCCG 1020 

CTCAGGGCTC CCCAGGGGCC CTCGTGGCGA GGAGCCTCCT GCGGCGGCCC CTGTGTCCCT 1080 

GCCTCTACCT GCGCACCTGC ATGTGTTCAA CCCCCGGGAG AACACCTGGC GGCCCCTGAC 1140 

35 CCAGGTGCCC GAGGAGGCCC CGCTTCGGGG CTGCGGTCTC TGCACCATGC ACAACTACCT 1200 

GTTTCTGGCG GGGGGCATCC GTGGCTCCGG TGCCAAGGCC GTCTGCTCCA ACGAGGTCTT 1260 

CTGCTACAAC CCTCTGACCA ACATCTGGAG CCAGGTTCGG CCCATGCAGC AGGCCCGAGC 1320 

CCAGCTCAAG CTGGTGGCCC TGGACGGGCT GCTCTATGCC ATCGGTGGCG AATGCCTGTA 1380 

CAGCATGGAG TGCTACGACC CGCGAACAGA CGCCTGGACC CCACGCGCGC CACTCCCCGC 1440 

40 AGGCACCTTC CCTGTGGCCC ACGAGGCTGT GGCCTGCCGT GGGGACATCT ACGTCACCGG 1500 

GGGTCACCTC TTCTACCGCC TGCTCAGGTA CAGCCCCGTG AAGGATGCTT GGGACGAGTG 1560 

CCCATACAGT GCCAGCCACC GGCGTTCCAG CGACATCGTG GCACTGGGGG GCTTCCTGTA 1620 

CCGCTTCGAC CTGCTGCGGG GCGTGGGCGC CGCCGTGATG CGCTACAACA CAGTGACCGG 1680 

CTCCTGGAGC AGGGCTGCCT CCCTGCCCCT GCCCGCCCCC GCCCCACTGC ACTGCACCAC 1740 

45 CCTGGGCAAC ACCATTTACT GCCTCAACCC CCAGGTCACT GCCACCTTCA CGGTCTCTGG 1800 

GGGGACTGCC CAGTTCCAGG CCAAGGAGCT GCAGCCCTTC CCCTTGGGGA GCACCGGGGT 1860 

CCTCAGTCCA TTCATCCTGA CTCTGCCCCC TGAGGACCGG CTGCAGACCT CACTCTGAGT 1920 

GGCAGGCAGA GAACCAAAGC TGCTTCGCTG CTCTCCAGGG AGACCCTCCT GGGATGGGCC 1980 

TGAGAGGCCG GGGCTCAGGG AAGGGGCTGG GATCGGAACT TCCTGCTCTT GTTTCTGGAC 2040 

50" AACTTTCCCC TTCTGCTTTA AAGGTTGTCG ATTATTTTGA AGCCCAGACT CCCTCAGCCT 2100 

CTTTCTGCCC CTCACTCCAC ACCCAGACTG TTTCCTGACT CAATTCCGTA CCTACTTACA 2160 

GACCCTCTCA GCTTGCTGAC ACCCCCCTGT CTGTGGGACT CCCTATTCCC TAGAGCCAGG 2220 

GACTGATGCG TCTCCACAGA CAAGGACTTG GCTCGCTGGA GCTCTGCTGA GCCGAGAGAG 2280 

GAGGGGGTAG AAAACATTCA CACTTCCTAT GCTCTGTCAG CAGGACAGGG AGCAAAAACG 2340 

55 TCCCCAGGCA ACGCCCTCGC CTCTGGGACT TTCTGCCTGT CCTAAGGCCT CCCCAGGTAC 2400 

CAACCCCGTA GCTATCTGGG TCTGTTTGGC ACTGTGGATT CTCAAGGGCC TAGAACCCTT 2460 

GCCTCTGAAA CTGGTCCGCT GGTGCAGCCC TGCTGTCTGC AGCTCCTGCC CATACCCCCA 2520 

GCCCACACCA GGCCAGGCCC ACTCCGGGCT CACCACCCTC TGCAGCCTTG TGGGGCTCTC 2580 

CCAGCCCCTC CAGAAGCCCA CCCCACTTCT CGCCAACCCC CGATCTCTAA ATGAGGCCTG 2640 

60 AGCGTCACCC TAGTTCTGCC CCTTTTTAGC TGTGTAGACT TGGACGAGAC ATTTGACTTC 2700 

CCTTTCTCCT TGTCTATAAA ATGTGGACAG TGGACGTCTG TCACCCAAGA GAGTTGTGGG 2760 

AGACAAGATC ACAGCTATGA GCACCTCGCA CGGTGTCCAG GATGCACAGC ACAATCCATG 2820 

ATGCGTTTTC TCCCCTTACG CACTTTGAAA CCCATGCTAG AAAAGTGAAT ACATCTGACT 2880 

GTGCTCCACT CCAACCTCCA GCCTGGATGT CCCTGTCTGG GCCCTTTTTC TGTTTTTTAT 2940 

65 TCTATGTTCA GCACCACTGG CACCAAATAC ATTTTAATTC ACCGAAAGCA 

Seq ID NO: 32 Protein Sequence 
Protein Accession &: XPJ)54631.1 

70 1 11 21 31 41 51 

I I I I I I 

MPRGPAQPPA QRPPGPAASS SARRSQPVPQ LRKRSRCEIA PSSEQBVRPA ASGDPQGEAP 60 

GEGGSPAGRS GALTEKQEEA RKLMVFLQRP GGWGWEGPR KPSSRALEPA TAAALRRRLD 120 

LGSCLDVLAF AQQHGEPGLA QETYALMSDN LLRVLGDPCL YRRLSAADRE RIDSLRTGRG 180 

75 RAVLGVLVLP SLYQGGRSGL PRGPRGEEPP AAAPVSLPLP AHLHVPNPRE NTWRPLTQVP 240 

EEAPLRGCGL CTMHNYLFLA GGIRGSGAKA VCSNEVFCYN PLTNIWSQVR PMQQARAQLK 300 

LVALDGLLYA IGGECLYSME CYDPRTDAWT PRAPLPAGTF PVAHEAVACR GDIYVTGGHL 360 

FYRLLRYSPV KDAWDECPYS ASHRRSSDIV ALGGFLYRFD LLRGVGAAVM RYNTVTGSWS 420 
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WO 03/003906 



RAASLPLPAP APLHCTTLGN TIYCLNFQVT ATFTVSGGTA QPQAKELQPP PLGSTGVLSP 4 BO 
FILTLPPEDR LQTSL 



Seq ID NO: 33 DNA sequence 

Nucleic Acid Accession ft: CAT cluster 



1 11 21 31 41 51 

CCTCTGTCTA ACCTTACATG AAAAAACCCG TTTCCAACGA AGGCCTCTAA AAGGCCAAGA 60 

AATCCCCTTG CAGACTTTAC AAACAGAGTG TTTCCTAACT GCTCTATGAA CAGAAAGGTT 120 

AAACTCTGTG AGTTGAACCA ACACATCACA ACGCAGTTTG TGGGAATGAT TCTGTCTAGT 180 

TTTGAAACGA AGATATTTCC TTTTCTGCCG TTGACCTTAA AGCGCTTGAA ATCTACACTT 240 

GCAAATTGCA CAAATAGAGT GTTTCAAATC TGCTCTGTCT AAGGGAACGT TCAACTCTGT 300 

GAGTTGAATG CACACAACAC AAGGAAGTTA CTGGGAATTC TTCTGTCTAG CCTTACATGA 360 

AAGAAAACCC GTTTCCAACG AAGGCCTCTA AGTGGTCAAA ATATCCACGT GCAGACTTTA 420 

CAAACAGAGT GTTTCCAAAC TGCTGAATGA AAAGAAAAGT AAACTCTGAG AGTTGAACGC 480 

ACACATCGCA GAGCAGTTTC TGAGAATGAT TCTGTCTAGT TTTTATACGA AGATATTTCC 540 

TTTTCTGCCT TTGGCCTCAA AGCGCTTGAA ATCTCCACTT GCAAATTCCA CAAAAAGAGT 600 

GTTTCAAATC TGCTCTGTCT AAAGGAAGGT TCAACTCTGT GAGTTGAATA CACACAACAC 660 

AAAGAAGTTA CTGAGAATTC TTCTGTCTAG CATTATATGA AGAAATCCCG TTTCCAACGA 720 

AGGCATCTAA GAGGTCCAAA TATCCACTTG CAGACTTTAC AAACAGAGGG TTTCCAGAAT 780 
GGCTGTATGA AAAGAAAGGT TAAACTCTGT GAGTTAAACA CACA 



Seq ID NO: 34 DNA sequence 

Nucleic Acid Accession ft: AF011468.1 

Coding sequence: 2 57.. 14 6 8 

1 11 21 31 41 

I I I I _ I 



GGAAGACTTG GGTCCTTGGG TCGCAGGTGG GAGCCGACGG GTGGGTAGAC CGTGGGGGAT 60 

ATCTCAGTGG CGGACGAGGA CGGCGGGGAC AAGGGGCGGC TGGTCGGAGT GGCGGAGCGT 120 

CAAGTCCCCT GTCGGTTCCT CCGTCCCTGA GTGTCCTTGG CGCTGCCTTG TGCCCGCCCA 180 

GCGCCTTTGC ATCCGCTCCT GGGCACCGAG GCGCCCTGTA GGATACTGCT TGTTACTTAT 240 

TACAGCTAGA GGCATCATGG ACCGATCTAA AGAAAACTGC ATTTCAGGAC CTGTTAAGGC 300 

TACAGCTCCA GTTGGAGGTC CAAAACGTGT TCTCGTGACT CAGCAAATTC CTTGTCAGAA 360 

TCCATTACCT GTAAATAGTG GCCAGGCTCA GCGGGTCTTG TGTCCTTCAA ATTCTTCCCA 420 

GCGCGTTCCT TTGCAAGCAC AAAAGCTTGT CTCCAGTCAC AAGCCGGTTC AGAATCAGAA 480 

GCAGAAGCAA TTGCAGGCAA CCAGTGTACC TCATCCTGTC TCCAGGCCAC TGAATAACAC 540 

CCAAAAGAGC AAGCAGCCCC TGCCATCGGC ACCTGAAAAT AATCCTGAGG AGGAACTGGC 600 

ATCAAAACAG AAAAATGAAG AATCAAAAAA GAGGCAGTGG GCTTTGGAAG ACTTTGAAAT 660 

TGGTCGCCCT CTGGGTAAAG GAAAGTTTGG TAATGTTTAT TTGGCAAGAG AAAAGCAAAG 720 

CAAGTTTATT CTGGCTCTTA AAGTGTTATT TAAAGCTCAG CTGGAGAAAG CCGGAGTGGA 760 

GCATCAGCTC AGAAGAGAAG TAGAAATACA GTCCCACCTT CGGCATCCTA ATATTCTTAG 840 

ACTGTATGGT TATTTCCATG ATGCTACCAG AGTCTACCTA ATTCTGGAAT ATGCACCACT 900 

TGGAACAGTT TATAGAGAAC TTCAGAAACT TTCAAAGTTT GATGAGCAGA GAACTGCTAC 960 

TTATATAACA GAATTGGCAA ATGCCCTGTC TTACTGTCAT TCGAAGAGAG TT ATT CAT AG 1020 

AGACATTAAG CCAGAGAACT TACTTCTTGG ATCAGCTGGA GAGCTTAAAA TTGCAGATTT 1080 

TGGGTGGTCA GTACATGCTC CATCTTCCAG GAGGACCACT CTCTGTGGCA CCCTGGACTA 1140 

CCTGCCCCCT GAAATGATTG AAGGTCGGAT GCATGATGAG AAGGTGGATC TCTGGAGCCT 1200 

TGGAGTTCTT TGCTATGAAT TTTTAGTTGG GAAGCCTCCT TTTGAGGCAA ACACATACCA 1260 

AGAGACCTAC AAAAGAATAT CACGGGTTGA ATTCACATTC CCTGACTTTG TAACAGAGGG 1320 

AGCCAGGGAC CTCATTTCAA GACTGTTGAA GCATAATCCC AGCCAGAGGC CAATGCTCAG 1380 
AGAAGTACTT GAACACCCCT GGATCACAGC AAATTCATCA AAACCATCAA ATTGCCAAAA 1440 
CAAAGAATCA GCTAGCAAAC AGTCTTAGGA ATCGTGCAGG GGGAGAAATC CTTGAGCCAG 1500 
GGCTGCCATA TAACCTGACA GGAACATGCT ACTGAAGTTT ATTTTACCAT TC^CTGC 1560 
CCTCAATCTA GAACGCTACA CAAGAAATAT TTGTTTTACT CAGCAGGTGT GCCTTAACCT 1620 
CCCTATTCAG AAAGCTCCAC ATCAATAAAC ATGACACTCT GAAGTGAAAG TAGCCACGAG 1680 
AATTGTGCTA CTTATACTGG TTCATAATCT GGAGGCAAGG TTCGACTGCA GCCGCCCCGT 1740 
CAGCCTGTGC TAGGCATGGT GTCTTCACAG GAGGCAAATC CAGAGCCTGG CTGTGGGGAA 1800 
AGTGACCACT CTGCCCTGAC CCCGATCAGT TAAGGAGCTG TGCAATAACC TTCCTAGTAC I860 
CTGAGTGAGT GTGTAACTTA TTGGGTTGGC GAAGCCTGGT AAAGCTGTTG ™AGTAT 1920 
GTGATTCTTT TTAAGTATGA AAATAAAGAT ATATGTACAG ACTTGTATTT TTTCTCTGGT 1980 
GGCATTCCTT TAGGAATGCT GTGTGTCTGT CCGGCACCCC GGTAGGCCTG ATTGGGTTTC 2040 
^CCTCCT TAACCACTTA TCTCCCATAT GAGAGTGTGA AAAATAGGAA CACGTGCTCT 2100 
SSSSS AG^GGATTTGC TTGGGATACA GAAGAGGCCA TGTGTCTCAG AGCTGTTAAG 2160 
GGCTTATTTT TTTAAAACAT TGGAGTCATA GCATGTGTGT AAACTTTAAA TATGCAAATA 
AATAAGTATC TATGTCTAAA AAAAAAAAAA AAA 

Seq ID NO: 35 Protein Sequence 
protein Accession ft: AAC63 902.1 

Lrskencis gpvkatapvg gpkrvlvtqq ipcqnplpvn sgqaqrvlcp snssqrvplq 

AQKLVSSHKP VQNQKQKQLQ ATSVPHPVSR PLNNTQKSKQ PLPSAPENNP EEELASKQKN 
EESKKRQWAL EDFEIGRPLG KGKFGNVYLA REKQSKFILA LKVLFKAQLE KAGVEHQLRR 
EVEIQSHLRH PNILRLYGYF HDATRVYLIL EYAPLGTVYR ELQKLSKFDE QRTATYITEL 
ANALSYCHSK RVIHRDIKPE NLLLGSAGEL KIADFGWSVH APSSRRTTLC GTLDYLPPEM 



2160 
2220 



60 

120 

180 

240 

300 



240 



W0 03/003906 PCT/US02/21338 



IEGRMHDEKV DLWSLGVLCY EFLVGKPPFE ANTYQETYKR ISRVEFTFPD FVTEGARDLI 360 
SRiLKHNPSQ RPMItREVLEH PWITANSSKP SNCQNKESAS KQS 

. Seq ID NO: 36 DNA sequence 
J Nucleic Acid Accession #: NM_016267 
Coding sequence: 67.. 843 

1 11 21 31 41 51 

in 1 1 > I > I 

IKJ CGGCACGAGG ACAGAATCAG GTGATGGTCC AGAATTAAGA GCTGTCACCT GTGTCATTCA 60 

CTCACAATGG AAGAAATGAA GAAGACTGCC ATCCGGCTGC CCAAAGGCAA ACAGAAGCCT 120 

ATAAAGACGG AATGGAATTC CCGGTGTGTC CTTTTCACCT ACTTCCAAGG GGACATCAGC 180 

AGCGTAGTGG ATGAACACTT CTCCAGAGCT CTGAGCAATA TCAAGAGCCC CCAGGAATTG 240 

ACCCCCTCGA GTCAGAGTGA AGGTGTGATG CTGAAAAACG ATGATAGCAT GTCTCCAAAT 300 

I J CAGTGGCGTT ACTCGTCTCC ATGGACAAAG CCACAACCAG AAGTACCTGT CACAAACOGT 360 

GCCGCCAACT GCAACTTGCA TGTGCCTGGT CCCATGGCTG TGAATCAGTT CTCACCGTCC 420 

CTGGCTAGGA GGGCCTCTGT TCGGCCTGGG GAGCTGTGGC ATTTCTCCTC CCTGGCGGGC 480 

ACCAGCTCCT TAGAGCCTGG CTACTCTCAT CCCTTCCCCG CTCGGCACCT GGTTCCAGAG 540 ■ 

CCCCAGCCTG ATGGGAAACG TGAGCCTCTC CTAAGTCTCC TCCAGCAAGA CAGATGCCTA 600 

ZU GCCCGTCCTC AGGAATCTGC CGCCAGGGAG AATGGCAACC CTGGCCAGAT AGCTGGAAGC 660 

ACAGGGTTGC TCTTCAACCT GCCTCCCGGC TCAGTTCACT ATAAGAAACT ATATGTATCT 720 

CGTGGATCTG CCAGTACCAG CCTTCCAAAT GAAACTCTTT CAGAGTTAGA GACACCTGGG 780 

AAATACTCAC TTACACCACC AAACCACTGG GGCCACCCAC ATCGATACCT GCAGCATCTT 840 

TAGTCAAGTT GGAGGAGAAA GACAACACTT GGTCTAAGAC ACGGCAGCAA GACATCCCTG 900 

2 J CATATTGTTC CAGATAAAAA TGAAAGCTGC TCACACCCAC TTGCCTCCCC AATCTGTTAA 960 

ACAGCTTCGT GTCTAGTATG AGCTCAGTAC TTGCCCTGTG AAAATCCCAG AAGCCCCCGC 1020 

TGTCAATGTT CCCCATCCAC ACCCTGCTTG CTCCTGTGTA ACAGCTCAGA TGATGAATAA 1080 
TAATAAAACT GTACTTTTTT GGAAAAAAAA AAAAAAAAAA AAAAAA 
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Seq ID NO: 37 Protein Sequence 
Protein Accession #: NP_057351.1 

1 11 21 31 41 51 

I I I I I I 

MEEMKKTAIR LPKGKQKPIK TEWNSRCVLF TYPQGDISSV VDEHPSRALS NIKSPQELTP 60 

PSSQSEGVML KNDDSMSPNQ WRYSSPWTKP QPEVPVTNRA ANCNLHVPGP MAVNQFSPSL 120 

AARRASVRPG ELWHFSSLAG TSSLEPGYSH PFPARHLVPE PQPDGKREPL LSLLQQDRCL 180 

ARRPQESAAR ENGNPGQIAG STGLLFNLPP GSVHYKKLYV SRGSASTSLP NETLSELETP 240 
GKYYSLTPPN HWGHPHRYLQ- HL 

Seq ID NO: 38 DNA sequence 

Nucleic Acid Accession #: AK058088.1 

Coding sequence: 252.. 1772 



11 21 31 . 41 51 

I I I I I I 

AGGAAACCAA GGCAAGCTCC CCCTGTCAAA GCACCTTGGC CCATAAGAAG AAAAGGGGGA 60 

GCCCCAGATG TGATGAGCGC TTCCAGGCTT CAGGCTCAGA AGGCGCCCCC AGCTCTCCTG 120 

TAACTCAGAG GCCAGTGTGA TGGGAGTTCC TCCACTCAGC ACACTTCCCC TGTAAACACG 180 

50 CCTGTGGTGG GCAAAAGGGC TTTGGAACGG TTGCTTGTCT TTTCTCTCCT GCGTAATTTC 240 

CACTTTCATT CATGATAATG TCGAACACGC ACAAAGCTCG GCTGGAACGC CGGGTCACTG 300 

GCTCAACCAA CCGGTGGCGT TTGCCCAAAC AGCCTTTCTC TGGGGACCTG CTCTCACTTT 360 

CCCAGATGTG CAAGGCTCTG AGCATAGACT TTGAGGAAGC TTTGAGGAAC CCAGACAGGT 420 

TATGCATTTC ACAAATCCAG AAGTTTTTCT TTGAGAATTT CAAGAACAAG GACATCCAAA 480 

55 GTGGGGAAGC AGATGTGATT CTCGAGTGCC TGGGCTTCAA ATGGGAGCTC CATCAGCCCC 540 

GGCTTTTTCA GTCTGAGACC TTGGCCAAGC TCTACCTGAA AGCCCTGGCG CAGGGCACCA 600 

CACACCCCCT GAGGGAGCTG GAGGAGCTTC TGCGAGCTCA ATCACCTAAG AAGACCAAAG 660 

AAAAATCCCC TGCAAAGAGG ATCATCATTT CCTTGAAGAT CAATGACCCA CTGGTCACTA 720 

AAGTCGCCTT CGCCACGGCC CTGAAGAACC TCTACATGAG TGAGGTGGAG ATTAACTTGG 780 

60 AAGACCTACT GGGAGTGCTG GCTTCCGCCC ACATCCTCCA GTTCAGTGGC CTGTTTCAAA 840 

GGTGCGTGGA TGTGATGATA GCCAGACTCA AGCCAAGCAC CATCAAGAAA TTCTACGAGG 900 

CCGGCTGCAA GTACAAGGAA GAGCAGCTCA CCACCGGCTG CGAGAAGTGG CTGGAAATGA 960 

ACTTGGTTCC TCTAGGGGGG ACGCAGATCC ACCTCCACAA AATCCCACAG GACCTGCTCC 1020 

ACAAAGTGCT GAAGTCCCCC AGGTTATTTA CCTTTAGTGA ATTCCATCTT CTGAAAACAA 1080 

65 TGCTTTTGTG GGTCTTCTTG CAACTGAACT ACAAGATTCA GGCAATTCCG ACTTATGAAA 1140 

CCGTGATGAC ATTTTTTAAG AGCTTTCCTG AGAACTGTTG CTTTCTGGAC CGGGACATAG 1200 

GACGGAGCTT GAGGCCGCTC TTCCTCTGCT TGCGTCTGCA CGGCATCACC AAAGGCAAGG 1260 

ATCTGGAGGT GCTGCGGCAC CTTAACTTCT TCCCAGAGTC ATGGCTCGAC CAGGTTACAG 1320 

TCAACCATTA CCACGCACTG GAGAATGGGG GCGACATGGT CCACCTGAAA GATCTTAACA 1380 

70 CCCAGGCTGT GAGATTTGGG CTGCTCTTTA ACCAGGAGAA TACAACTTAT TCGAAAACGA 1440 

TTGCTCTATA TGGATTCTTC TTTAAGATAA AGGGACTCAA ACATGATACT ACCTCTTATA 1500 

GTTTTTACAT GCAGAGAATA AAGCACACAG ACCTGGAATC TCCCTCTGCG GTCTACGAGC 1560 

ACAACCACGT CAGCCTGCGA GCGGCACGCC TGGTGAAGTA TGAGATCAGA GCAGAGGCCC 1620 

TGGTTGACGG CAAGTGGCAG GAGTTCAGGA CAAACCAGAT CAAGCAGAAG TTTGGGTTGA 1680 

75 CCACGTCATC CTGCAAAAGC CATACCTTGA AAATCCAAAC TGTGGGCATC CCAATCTATG 1740 

TAAGTTTTGC ATTCATCTTC CCAGCATCTT GACAGTTTCC AGAAGAATCT ATGGGATTTT 1800 
CCCCCCACTG GTCTGCATAA AAGAAAATAA AATGACATAA AAGGGAGC 
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PCT/US02/21338 



Seq ID NO: 39 Protein Sequence 
Protein Accession 8: BAB71658.1 
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1 

1 

MIMSNTHKAR 
QIQKFFFENF 
RELEELLRAQ 
GV1ASAHILQ 
1.GGTQIHLHK 
FFKSFPENCC 
HALENGGDMV 
QRIKHTDLES 
CKSHTLKIQT 



11 
I 

LERRVTGSTN 
KNKDIQSGEA 
SPKKTKEKSP 
FSGLFQRCVD 
IPQDLLHKVL 
FLDRDIGRSL 
HLKDLNTQAV 
PSAVYEHNHV 
VGIPIYVSFA 



21 
I 

RWRLPKQPFS 
DVILECLGFK 
AKRIIISLKI 
VMIARLKPST 
KSPRLFTPSE 
RPLFLCLRLH 
RFGLLFNQEN 
SLRAARLVKY 
FIFPAS 



31 
I 

GDLLSLSQMC 
WELHQPRLFQ 
NDPLVTKVAF 
IKKFYEAGCK 
FHLLKTMLLW 
GITKGKDLEV 
TTYSKTIALY 
EIRAEALVDG 



41 
I 

KALSIDFEEA 
SETLAKLYLK 
ATALKNLYMS 
YKEEQLTTGC 
VFLQLNYKIQ 
LRHLNFFPES 
GFFFKIKGLK 
KWQEFRTNQI 



51 
! 

LRNPDRLCIS 
ALAQGTTHPL 
EVEINLEDLL 
EKWLEMNLVP 
AI PTYETVMT 
WLDQVTVNHY 
HDTTSYSFYM 
KQKFGLTTSS 
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Seq ID NO: 40 DNA sequence 

Nucleic Acid Accession #: NM_032899.1 

Coding sequence: 186.. 1070 



1 
I 

GGCACGAGGA 
CCTGGGAGCA 
TTGCTTCCCA 
GAGGCATGAG 
GCCAGTGGGT 
CGGACGCCCT 
AGGTGGACTT 
CGTGTCCCCC 
CCCTACAGTC 
ACAGCTGGGC 
TCCAGACCGT 
GCCAGAACAT 
AATTCACTGG 
GATCTTACAG 
CAGGCCAGGC 
CTGTGATGGG 
CCAATGGCCG 
TCAGCGGCCG 
GAATGACTGG 
AGGAAGGCAA 
CCTGTGCCTG 
CCCCCCTACT 
AAAGGTGCAG 
GTTAGGTCGG 
GGCAGAAGCT 
CCCTCCTCTG 
AACCCACAAT 
AAAAAAAA 



11 
I 

TTGCTCAGGA 
GGCGGCCTCC 
CATCTGGAGG 
CCGGTCAAGG 
CCGGCCAGCC 
CTTGGATGGG 
CTTGTCCTCG 
AGACACCCTG 
CGGCACCTAC 
CTCAGCTGAG 
CAAGCACAAC 
TTCCATCCGG 
CCAAATCCGG 
CTTCACCTGG 
GGTGGAGCTG 
CCTGAAGTCC 
CCTTAGCAGC 
CTCGGCAGGC 
GCCCGGCTGA 
AGCTTGCCAG 
TGTCGAGCTC 
CCTGACCACC 
TTTCTCCTCT 
GGCTGCCCCG 
TTGAGAGCCT 
CCAGCCTGGG 
GGATATGTAG 



21 
1 

CAGCGGTAAA 
CGGGGGTGCG 
AGCTGACGTG 
CACCTGGGCA 
AGGGCTGAGT 
GGTTCTGAAG 
GTGGAGGCCC 
GGAGGGGCGG 
TTCCCTGTGG 
AAGCCCTACC 
AACATCAGAG 
AGTGTGGAAG 
GAGAAGTTCA 
CTCTGCGGAC 
TTTGACGAGG 
CCGCGGCTGG 
AGCAGTGGCT 
AGCCACCCCG 
ACATTCCAAA 
GTCACAGAAG 
AGTCCTGGGA 
CTCCATCAGC 
CCTGGGCACC 
GCGAGTGGAG 
GGGTCCAGGT 
GTCCTAAGCG 
TGAGCAAGAA 



31 
I 

TCACTTCTTG 
GGAGCCCCAC 
CCAGCCTCCC 
AAATCCGGAA 
TTAGTGACAA 
CCTACTGGCG 
AGTACATCCA 
AAGCAGGCCC 
CCTCAGAGGG 
TGAAGGAAAA 
ACCTCGTCCG 
GAGAGATATA 
TCATCTCGGA 
ACGTGCACCG 
AGTTCCGCCA 
TCGCCCCCGT 
CCGCCAGTGA 
GTACCCGAAC 
TTGGATTTCA 
CTCCCAAGCC 
GATAGGGGAG 
AGTCTCCCCT 
TGTAACATGT 
CATGAGCAGA 
TGCCACATAG 
ATGAGCAGAA 
ATAAACCTTT 



41 
I 

GAGGTGCCCT 
TCCTCCGTGG 
CCAGCACCAC 
GCGTCTGGAA 
CGAGAGTGCC 
GGTGCTCAGC 
GGCCCAGGCC 
TAAGGGACTG 
CAGCGAGCCG 
ATCCAGCGCC 
CCGCTGCATC 
CTGTGCCAAG 
CTGGAGATTT 
GAACATCCTC 
CCTCTACGCC 
CCCGCCCGGA 
CCGCACGTCC 
TGACGGCTGA 
CCATCTGCTG 
CAGCTTTCCA 
AACCTGCAGG 
CCGTGGTCGT 
GATGCGCTGC 
ACCGCCGAGG 
AAGCAGCTCT 
TCCCCCACTC 
GTTGTTTAAA 



51 
I 

GCACGCCGGT 

TGTGTTCCAT 

CCAGGGACGG 

GATGTCAAGA 

CGGCTGGCCA 

CAGGAAGGCG 

AGGGAGCCCC 

GACTCCAGCT 

GCCCTACTGC 

ACTGTGTACT 

ACCCGGACTA 

TCAGGCAGGA 

GTCCTGTCTG 

TCCAAGTTCA 

TCCTCCAAGC 

GCAGCCCCGG 

TCCAACCCCT 

GATGAGGTTA 

AGAAAGTTTA 

AAGGCCTCAG 

CAGGAACAAG 

CTTTGTTGAC 

CTGCTGGGAG 

GTCACTTCTG 

CCAGTTGAAA 

CCACCCCACC 

AAAAAAAAAA 



Seq ID NO: 41 Protein Sequence 
Protein Accession ft: NP_1162B8.1 



1 11 21 

I I 
MSRSRHLGKI RKRLEDVKSQ 
DFLSSVEAQY IQAQAREPPC 
WASAEKPYLK EKSSATVYFQ 
TGQIREKFII SDWRPVLSGS 
MGLKSPRLVA PVPPGAAPAN 



31 
I 



41 



WVRPARADFS DNESARLATD ALLDGGSEAY 
PPDTLGGAEA GPKGLDSSSL QSGTYFPVAS 
TVKHNNIRDL VRRCITRTSQ NISIRSVEGE 
YSFTWLCGHV HRNILSKFTG QAVELFDEEF 
GRLSSSSGSA SDRTSSNPFS GRSAGSHPGT 



51 
I 

WRVLSQEGEV 
EGSEPALLHS 
IYCAKSGRKF 
RHLYASSKPV 
RTDG 



Seq ID NO: 42 DNA sequence 

Nucleic Acid Accession #: NMJJ00782.2 

Coding sequence: 405.. 1946 



1 
I 

TGGAGAGGGA 
CCCCGTGGCT 
TTTCCCTTTT 
GCAAGGAGGC 
GTCCTTCTGC 
TCCTTTTCCC 
ACCCCATCCT 
GCAAGAGCCG 
GACTGGTGAC 
TGACAGCTGG 
TGCTGGCGAG 



11 

I 

CAGGAGGAAA 
CCAGCAAGCC 
CAAGAGGTAC 
ACCAATGCTC 
TGCCTGTCCC 
AGAATCGTTA 
CCTGCCTTCC 
CTCGCTTGCC 
ATCTACGGCG 
TGGCGAGACT 
CCTGCTGCAG 



21 
I 

CGCAGCGCCA 
CTAGAGGTCA 
CCCAGACACC 
TGAGGCTGTC 
GCCTCACCCC 
AGCCCCGACT 
CGCGCTCCGG 
GCCTTCCTGC 
TACACGTCCC 
CAGAACGCGG 
ATTCTCTGGA 



31 



I 

GCAGCATCTC 
GCCTTGCGGA 
GGCCACCCTC 
GCGTGGTGCA 
GCTCCATCAC 
CCCACTAGCA 
TGTCCCCCGC 
AGCAGCTGCG 
CTCAGCCGCG 
CCGCCCTGCC 
AAGGGGGTCT 



41 
I 

ATCTACCCTC 
CCAACAGGAG 
TTCCAGCCCC 
GCGTCGAGCA 
ACCAGCTGGC 
CCTCGTACCA 
TGCCATGAGC 
CAGTCCGAGG 
AGAGGTGCCA 
GGGCCCCACC 
CAAGAAACAG 
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51 

r 

CTTGACACCT 
GACTCCCAGC 
TGCGGCCAGT 
TCCTCGCCGA 
CCTCTTTGCT 
ACCTCGCCCC 
TCCCCCATCA 
CAGCCCCCGA 
GTCTGCCCGC 
AGCTGGCCAC 
CACGACACCC 
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600 

660 
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Seq ID NO: 43 Protein Sequence 
Protein Accession #: NP 000773.1 



Seq ID NO: 44 DNA sequence 

Nucleic Acid Accession ft: NM_007000.2 

Coding sequence: 1-777 



TGGTGGAGTA CCACAAGAAG TATQGCAAGA TTTTCCGCAT GAAGTTGGGT TCCTTTGAGT 720 

CGGTGCACCT GGGCTCGCCA TGCCTGCTGG AAGOGCTGTA CCGCACCGAG AGCGTACCCC 780 

AGCGGCTGGA GATCAAACCG TGGAAGGCCT ATOGCGACTA CCGCAAAGAA GGCTACGGGC 840 

TGCTGATCCT GGAAGGGGAA GACTGGCAGC GGGTCCGGAG TGCCTTTCAA AAGAAACTAA 900 

J TGAAACCAGG GGAAGTGATG AAGCTGGACA ACAAAATCAA TGAGGTCTTG GCCGATTTTA 9S0 

TGGGCAGAAT AGATGAGCTC TGTGATGAAA GAGGCCACGT CGAAGACTTG TACAGCGAAC 1020 

TGAACAAATG GTCGTTTGAA AGTATCTGCC TCGTGTTGTA TGAGAAGAGA TTTGGGCTTC 1080 

TCCAGAAGAA TGCAGGGGAT GAAGCTGTGA ACTTCATCAT GGCCATCAAA ACAATGATGA 1140 

GCACGTTTGG GAGGATGATG GTCACTCCAG TCGAGCTGCA CAAGAGCCTC AACACCAAGG 1200 

1U TCTGGCAGGG ACACACTCTG GCCTGGGACA CCATTTTCAA ATCAGTCAAA GCTTGTATCG 1260 

ACAACCGGTT AGAGAAGTAT TCTCAGCAGC CTAGTGCAGA TTTCCTTTGT GACATTTATC 1320 

ACCAGAATCG GCTTTCAAAG AAAGAATTGT ATGCTGCTGT CACAGAGCTC CAGCTGGCTG 1380 

CGGTGGAAAC GACAGGAAAC AGTCTAATGT GGATTCTCTA CAATTTATCC CGTAATCCCC 1440 

1 AAGTGCAACA AAAGCTTCTT AAGGAAATTC AAAGTGTATT ACCTGAGAAT CAGAGGCCAC 1500 

lj GGGAGGAAGA TTTGAGGAAT ATGCCGTATT TAAAAGCCTG TCTGAAAGAA TCTATGAGGC 1560 

TTACCCCGGG TGTACCATTT ACAACTCGGA CTCTTGACAA GGCAACAGTT CTGGGTGAAT 1620 

ATGCTTTACC CAAAGGAACA GTGCTCATGC TAAATACCCA GGTGTTGGGA TCCAGTGAAG 1680 

ACAATTTTGA AGATTCAAGT CAGTTTAGAC CTGAACGTTG GCTTCAGGAG AAGGAAAAAA 1740 

TTAATCCTTT TGCGCATCTT CCATTTGGCG TTGGAAAAAG AATGTGCATT GGTCGCCGAT 1800 

ZU TAGCAGAGCT TCAACTGCAT TTGGCTCTTT GTTGGATTGT CCGCAAATAC GACATCCAGG 1860 

CCACAGACAA TGAGCCTGTT GAGATGCTAC ACTCAGGCAC CCTGGTGCCC AGCCGGGAAC 1920 

TCCCCATCGC GTTTTGCCAG CGATAATACG CCTCAGATGG TGGTATTTGC TAACATCATA 1980 

TCCAACTCAG GGAAGCGGAC TGAGTGCTGG GATCCAAGGC ATTCTACAGG GTTCACTGCT 2040 

GGTTTACACT TCACCTGTGT CAGCACCATC TTCAGGTGCT TAGAATGGCC TGGGAGCCTG 2100 

I J TTCTGTCTTG CATCTTCCAT GACATGAAAG GGAGGCTGGC ACTTGTCAGT CAGGTAGAGG 2160 

TTACAAACCG TTTCAGGCCC TGCTACCACA TTCACTGTTT GAATCTTTAA TTCCCAAGAA 2220 

TAAGTTTACA TTTCACAATG AATGACCTAC AACAGCTAAA TTTTCTGGGG CTGGGAGTAA 2280 

TACTGACAAT CCATTTACTG TAGCTCTGCT TAATGTACTA CTTAGGAAAA TGTCCCTGCT 2340 

„ TAATAATGTA AGCCAAGCTA AATGATGGTT AAAGTTATCA GGCCTCCCAT GAAATTGCGT 2400 

J\J TCTTCCTGCA TTGAAATAAA AACATTATTG GGAAACTAGA GAACACCTCT ATTTTTAAAA 2460 

GGACTTTAAC GAAGTCAAAC AACTTCTAAG ACTAGTGATT CACTGGGGCA TTATTTGTTA 2520 

GAGGACCTTA AAATTGTTTA TTTTTTAAAT GTGATTCCTT TATGGCATTA GGGTAAAGAT 2580 

GAAGCAATAA TTTTTAAATT GTGTATGTGC ATATGAAGCA CAGACATGCA TGTGTGTGTG 2640 

TGTCTGTGTG TGTGTGTCCG TGTATGTGTG TGTGGGTTCT AATGGTAATT TGCCTCAGTC 2700 

J J ATTTTTTTAA TATTTGCAGT ACTTGATTTA GGATCTGTGG TGCAGGGCAT GTTTCAAAGT 2760 

TTAGTCACAG CTTAAAAACA TTCAGTGTGA CTTTAATATT ATAAAATGAT TTCCCATGCC 2820 

ATAATTTTTC TGTCTATTAA ATGGGACAAG TGTAAAGCAT GCAAAAGTTA GAGATCTGTT 2880 

ATATAACATT TGTTTTGTGA TTTGAACTCC TAGGAAAAAT ATGATTTCAT AAATGTAAAA 2 94 0 

TGCACAGAAA TGCATGCAAT ACTTATAAGA CTTAAAAATT GTGTTTACAG ATGGTTTATT 3000 

4U TGTGCATATT TTTACTACTG CTTTTCCTAA ATGCATACTG TATATAATTC TGTGTATTTG 3060 

ATAAATATTT CTTCCTACAT TATATTTTTA GAATATTTCA GAAATATACA TTTATGTCTT 3120 

TATATTGTAA TAAATATGTA CATATCTAGG TATATGCTTT CTCTCTGCTG TGAAATTATT 3180 

TTTAGAATTA TAATTCACGT CTTGTCATAT TTCATCTGTA TACCTTCAAA TTCTCTGAAA 3240 
GTAAAAATAA AAGTTTTTAA ATATT 



1 11 21 31 41 51 

50 | [ | i | | 

MSSPISKSRS LAAFLQQLRS PRQPPRLVTS TAYTSPQPRE VPVCPLTAGG ETQNAAALPG 60 

PTSWPLLASL LQILWKGGLK KQHDTLVEYH KKYGKI FRMK LGSFESVHLG SPCLLEALYR " 120 

TESVPQRLEI KPWKAYRDYR KEGYGLLILE GEDWQRVRSA PQKKLMKPGE VMKLDNKINE 180 

VLADFMGRID ELCDERGHVE DLYSELNKWS FESICLVLYE KRFGLLQKNA GDEAVNFIMA 24 0 

55 IKTMMSTFGR MMVTPVELHK SLNTKVWQGH TLAWDTIFKS VKACIDNRLE KYSQQPSADF 300 

LCDIYHQNRL SKKELYAAVT ELQLAAVETT ANSLMWILYN LSRNPQVQQK LLKEIQSVLP 360 

ENQRPREEDL RNMPYLKACL KESMRLTPGV PFTTRTLDKA TVLGEYALPK GTVLMLNTQV 420 

LGSSEDNFED SSQFRPERWL QEKEKINPFA HLPFGVGKRM CIGRRLAELQ LHLALCWIVR 480 

KYDIQATDNE PVEMLHSGTL VPSRELPIAF CQR 



1 11 21 31 41 51 

1 1 I 1 I I 

ATGGCGTCTG CGGCAGCAGC GGAGGCCGAG AAGGGATCTC CAGTTGTGGT GGGCCTGCTA 60 

GTTGTGGGCA ATATCATTAT TCTGCTGTCA GGCCTGTCCC TGTTTGCTGA GACCATATGG 120 

GTGACAGCCG ACCAGTACCG TGTATACCCA CTGATGGGAG TCTCAGGCAA GGATGACGTC 180 

70 TTCGCTGGTG CCTGGATTGC CATCTTCTGC GGCTTCTCCT TCTTCATGGT AGCCAGTTTT 240 

GGTGTGGGTG CCGCACTCTG CCGCCGCCGG TCCATGGTCC TCACGTACCT GGTGCTCATG 300 

CTCATCGTCT ACATCTTCGA GTGCGCCTCC TGCATCAOGT CCTACACCCA CCGTGACTAC 360 

ATGGTGTCCA ACCCATCCCT GATCACCAAG CAGATGCTGA CCTTCTACAG CGCGGACACC 420 

GACCAGGGCC AGGAGCTGAC CCGCCTCTGG GACCGCGTCA TGATTGAGCA AGAATGCTGT 480 

GGCACATCTG GTCCCATGGA CTGGGTGAAC TTCACGTCAG CCTTCCGGGC GGCCACTCCG 540 

GAGGTGGTGT TCCCCTGGCC CCCACTGTGC TGTCGCCGGA CGGGAAACTT CATCCCCCTC 600 

AACGAGGAGG GCTGCCGCCT GGGGCACATG GACTACCTGT TCACCAAGGG CTGCTTCGAA 660 

CACATCGGCC ACGCCATCGA CAGCTACACG TGGGGTATCT CGTGGTTTGG GTTTGCCATC 720 

243 



WO 03/003906 



CTGATGTGGA CGCTCCCGGT CATGCTGATA GCCATGTATT TCTACACCAT GCTCTGAGGG 780 

ACAGGAGGGG AAGGCAACAT ACACACCCCG GACTCCTCCG CATCCTCCTC CTGCTTCCTC 840 

CGCTGGGCCT GGATGGCTGC CTCACCTCTC ACCTCCCAAC GTCCCTAGCC CTTACGTCCT 900 

TCCACTTCCA AGATCTTTTT CCAGGTTCCT GAGCCCTACT GTGTCTCAGG TGTGCCCTGA 960 

AACCCCAGGG CTTGTGTGCA CATATCCTTA GCCCATCTTT CAAGGGACCT CTCCATGATC 1020 

CCACCTCCCA TTCACAGATA CCTCTCTTGT AGCTCTCTGA CCTCCTCCTT CATGGCAGGC 1080 

ATCGCCATTC TTGCTGAACC GTTTGTGATT GCCATTTGAG CTCTGGAAGC CTCTATTGCC 1140 . 

ATGAGAGTTC TGTCACGGTC ACTTTACTGT CCCCATCATC ACCCAGCACG GGGCTAAGCA 1200 
TATACTAGAT AGTCAATAAA TAA 



Seq ID NO: 45 Protein sequence: 
Protein Accession 8: NP_008931.1 



1 11 21 31 41 51 

ill III 

MASAAAAEAE KGSPVWGLL WGNIIILLS GLSLFAETIW VTADQYRVYP LMGVSGKDDV 60 
FAGAWIAIFC GFSFFMVASF GVGAALCRRR SMVLTYLVLM LIVYIFECAS CITSYTHRDY 120 
MVSNPSLITK QMLTFYSADT DQGQELTRLW DRVMIEQECC GTSGPMDWVN FTSAFRAATP 180 
EWFPWPPLC CRRTGNFIPL NEEGCRLGHM DYLFTKGCFE HIGHAIDSYT WGISWFGFAI 240 
LMWTLPVMLI AMYFYTMI, 



Seq ID NO: 46 DNA sequence 

Nucleic Acid Accession #: NM_006760.1 

Coding sequence r 3 9 . . 593 

1 11 21 31 41 51 

| | I I I I 

GAAAGCCTGC CAGCACCTAT TCCACCTCCC AGCCCAGCAT GGCACCCCTG CTGCCCATCC 60 

GGACCTTGCC CTTGATCCTG ATTCTGCTGG CTCTGCTGTC CCCAGGGGCT GCAGACTTCA 120 

ACATCTCAAG CCTCTCTGGT CTGCTGTCCC CGGCGCTAAC GGAGAGCCTG CTGGTTGCCT 180 

TGCCCCCCTG TCACCTCACA GGAGGCAATG CCACACTGAT GGTCCGGAGA GCCAATGACA 240 

GCAAAGTGGT GACGTCCAGC TTTGTGGTGC CTCCGTGCCG TGGGCGCAGG GAACTGGTGA 300 

GTGTGGTGGA CAGTGGTGCT GGCTTCACAG TCACTCGGCT CAGTGCATAC CAGGTGACAA 360 

ACCTCGTGCC AGGAACCAAA TTCTACATTT CCTACCTAGT GAAGAAGGGG ACAGCCACTG 420 

AGTCCAGCAG AGAGATCCCA ATGTCCACAC TCCCTCGAAG GAACATGGAA TCCATTGGGC 480 

TGGGTATGGC CCGCACAGGG GGCATGGTGG TCATCACGGT GCTGCTCTCT GTCGCCATGT 540 

TCCTGCTGGT GCTGGGCTTC ATCATTGCCC TGGCACTGGG CTCCCGCAAG TAAGGAGGTC 600 

TGCCCGGAGC AGCAGCTTCT CCAGGAAGCC CAGGGCACCA TCCAGCTCCC CAGCCCACCT 660 

GCTCCCAGGC CCCAGGCCTG TGGCTCCCTT GGTGCCCTCG CCTCCTCCTC CTGCCCTCCT 720 

CTCCCCTAGA GCCCTCTCCT CCCTCTGTCC CTCTCCTTGC CCCCAGTGCC TCACCTTCCA 7 80 

ACACTCCATT ATTCCTCTCA CCCCACTCCT GTCAGAGTTG ACTTTCCTCC CATTTTACCA 840 

CTTTAAACAC CCCCATAACA ATTCCCCCAT CCTTCAGTGA ACTAAGTCCC TATAATAAAG 900 
GCTGAGGCTG CATCTGCCAA AAAAAAAAAA AA 



Seq ID NO: 47 Protein Sequence 
Protein Accession #: NPJ>06751.1 

1 11 21 31 41 51 

MAPLLPIRTL PLILILLALL SPGAADFNIS SLSGLLSPAL TESLLVALPP CHLTGGNATL 60 

MVRRANDSKV VTSSFWPPC RGRRELVSW DSGAGFTVTR LSAYQVTNLV PGTKFYISYL 12 1 

VKKGTATESS REIPMSTLPR RNMESIGLGM ARTGGMWIT VLLSVAMFLL VLGFIIALAL 18 
GSRK 



Seq ID NO: 48 DNA sequence 

Nucleic Acid Accession ft: EOS sequence 

Coding sequence: 200-2932 

I 11 21 31 41 51 

ATTGCTGATG GATCAGTGAG CCTGTGTTCA TGCCAGTGAG CTGCTGTGGC TCAGATACTG 60 

ATACTTTCTT TCCAAACAGC ATAAGAAGTG ATTGAGCCAC AAGTATACTG AAGGAAGGGC 120 

TCCCTCGAGT TCTGGTGTGA AGAGATAAAT CACCAGTCAC AGACTATGCA CCCGACTGCT 180 

GCTGTTCAGT CCAGGGAAAA TGAAAGTTGG AGTGCTGTGG CTCATTTCTT TCTTCACCTT 240 

CACTGACGGC CACGGTGGCT TCCTGGGGAA AAATGATGGC ATCAAAACAA AAAAAGAACT 300 

CATTGTGAAT AAGAAAAAAC ATCTAGGCCC AGTCGAAGAA TATCAGCTGC TGCTTCAGGT 360 

GACCTATAGA GATTCCAAGG AGAAAAGAGA TTTGAGAAAT TTTCTGAAGC TCTTGAAGCC 420 

TCCATTATTA TGGTCACATG GGCTAATTAG AATTATCAGA GCAAAGGCTA CCACAGACTG 4B0 

CAACAGCCTG AATGGAGTCC TGCAGTGTAC CTGTGAAGAC AGCTACACCT GGTTTCCTCC 540 

CTCATGCCTT GATCCCCAGA ACTGCTACCT TCACACGGCT GGAGCACTCC CAAGCTGTGA 600 

ATGTCATCTC AACAACCTCA GCCAGAGTGT CAATTTCTGT GAGAGAACAA AGATTTGGGG 660 

CACTTTCAAA ATTAATGAAA GGTTTACAAA TGACCTTTTG AATTCATCTT CTGCTATATA 720 

CTCCAAATAT GCAAATGGAA TTGAAATTCA ACTTAAAAAA GCATATGAAA GAATTCAAGG 780 

TTTTCAGTCG GTTCAGGTCA CCCAATTTCG AAATGGAAGC ATCGTTGCTG GGTATGAAGT 840 

TGTTGGCTCC AGCAGTGCAT CTGAACTGCT GTCAGCCATT GAACATGTTG CCGAGAAGGC 900 

TAAGACAGCC CTTCACAAGC TGTTTCCATT AGAAGACGGC TCTTTCAGAG TGTTCGGAAA 960 

AGCCCAGTGT AATGACATTG TCTTTGGATT TGGGTCCAAG GATGATGAAT ATACCCTGCC 1020 

CTGCAGCAGT GGCTACAGGG GAAACATCAC AGCCAAGTGT GAGTCCTCTG GGTGGCAGGT 1080 

244 
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CATCAGGGAG ACTTGTGTGC TCTCTCTGCT TGAAGAACTG AACAAGAATT TCAGTATGAT 1140 

TGTAGGCAAT GCCACTGAGG CAGCTGTGTC ATCCTTCGTG CAAAATCTTT CTGTCATCAT 1200 

TCGGCAAAAC CCATCAACCA CAGTGGGGAA TCTGGCTTCG GTGGTGTCGA TTCTGAGCAA 1260 

TATTTCATCT CTGTCACTGG CCAGCCATTT CAGGGTGTCC AATTCAACAA TGGAGGATGT 1320 

CATCAGTATA GCTGACAATA TCCTTAATTC AGCCTCAGTA ACCAACTGGA CAGTCTTACT 1380 

GCGGGAAGAA AAGTATGCCA GCTCACGGTT ACTAGAGACA TTAGAAAACA TCAGCACTCT 1440 

GGTGCCTCOG ACAGCTCTTC CTCTGAATTT TTCTCGGAAA TTCATTGACT GGAAAGGGAT 1500 

TCCAGTGAAC AAAAGCCAAC TCAAAAGGGG TTACAGCTAT CAGATTAAAA TGTGTCCCCA 1560 

AAATACATCT ATTCCCATCA GAGGCCGTGT GTTAATTGGG TCAGACCAAT TCCAGAGATC 1620 

CCTTCCAGAA ACTATTATCA GCATGGCCTC GTTGACTCTG GGGAACATTC TACCCGTTTC 1680 

CAAAAATGGA AATG CTCAGG TCAATG GACC TGTGATATCC ACGGTTATTC AAAACTATTC 1740 

CATAAATGAA GTTTTCCTAT TTTTTTCCAA GATAGAGTCA AACCTGAGCC AGCCTCATTG 1800 

TGTGTTTTGG GATTTCAGTC ATTTGCAGTG GAACGATGCA GGCTGCCACC TAGTGAATGA 1860 

AACTCAAGAC ATCGTGACGT GCCAATGTAC TCACTTGACC TCCTTCTCCA TATTGATGTC 1920 

ACCTTTTGTC CCCTCTACAA TCTTCCCCGT TGTAAAATGG ATCACCTATG TGGGACTGGG 1980 

TATCTCCATT GGAAGTCTCA TTTTATGCCT GATCATCGAG GCTTTGTTTT GGAAGCAGAT 2040 

TAAAAAAAGC CAAACCTCTC ACACACGTCG TATTTGCATG GTGAACATAG CCCTGTCCCT 2100 

CTTGATTGCT GATGTCTGGT TTATTGTTGG TGCCACAGTG GACACCACGG TGAACCCTTC 2160 

TGGAGTCTGC ACAGCTGCTG TGTTCTTTAC ACACTTCTTC TACCTCTCTT TGTTCTTCTG 2220 

GATGCTCATG CTTGGCATCC TGCTGGCTTA CCGGATCATC CTCGTGTTCC ATCACATGGC 2280 

CCAGCATTTG ATGATGGCTG TTGGATTTTG CCTGGGTTAT GGGTGCCCTC TCATTATATC 2340 

TGTCATTACC ATTGCTGTCA CGCAACCTAG CAATACCTAC AAAAGGAAAG ATGTGTGTTG 2400 

GCTTAACTGG TCCAATGGAA GCAAACCACT CCTGGCTTTT GTTGTCCCTG CACTGGCTAT 2460 

TGTGGCTGTG AACTTCGTTG TGGTGCTGCT AGTTCTCACA AAGCTCTGGA GGCCGACTGT 2520 

TGGGGAAAGA CTGAGTCGGG ATGACAAGGC CACCATCATC CGCGTGGGGA AGAGCCTCCT 2580 

CATTCTGACC CCTCTGCTAG GGCTCACCTG GGGCTTTGGA ATAGGAACAA TAGTGGACAG 2640 

CCAGAATCTG GCTTGGCATG TTATTTTTGC TTTACTCAAT GCATTCCAGG GATTTTTTAT 2700 

CTTATGCTTT GGAATACTCT TGGACAGTAA GCTGCGACAA CTTCTGTTCA ACAAGTTGTC 2760 

TGCCTTAAGT TCTTGGAAGC AAACAGAAAA GCAAAACTCA TCAGATTTAT CTGCCAAACC 2820 

CAAATTCTCA AAGCCTTTCA ACCCACTGCA AAACAAAGGC CATTATGCAT TTTCTCATAC 2880 

TGGAGATTCC TCCGACAACA TCATGCTAAC TCAGTTTGTC TCAAATGAAT AAGGCAAGGA 2940 

ATCATAAAAT CAAGAAAAAA TTTCCAGAAC AACTTGACAT TTAGAGACAA ATGTCAATGA 3000 

AGAAATTATG CTCAGTATTC GATCGGGTTT TCTGATTTAG GGGTCTGGGA ATAAAACAAG 3060 
AATGTCTCAG TGGCTTCA 

Seq ID NO: 49 Protein sequence: 
Protein Accession #: EOS sequence 

1 11 21 31 41 51 

I I I 1 I I 

MKVGVLWLIS FFTFTDGHGG FLGKNDGIKT KKELIVNKKK HLGPVEEYQL LLQVTYRDSK . 60 

EKRDLRNFLK LLKPPLLWSH GLIRIIRAKA TTDCNSLNGV LQCTCEDSYT WPPPSCLDPQ 120 

NCYLHTAGAL PSCECHLNNL SQSVNFCERT KIWGTFKINE RFTNDLLNSS SAIYSKYANG 180 

IEIQLKKAYE RIQGFESVQV TQFRNGSIVA GYEWGSSSA SELLSAIEHV AEKAKTALHK 240 

LFPLEDGSFR VFGKAQCNDI VFGFGSKDDE YTLPCSSGYR GNITAKCESS GWQVIRETCV 300 

LSIiLEELNKN FSMIVGNATE AAVSSFVQNL SVIIRQNPST TVGNLASWS ILSNISSLSL 360 

ASHFRVSNST MEDVISIADN ILNSASVTNW TVLLREEKYA SSRLLETLEN ISTLVPPTAL 420 

PLNFSRKFID WKGIPVNKSQ LKRGYSYQIK MCPQNTSIPI RGRVLIGSDQ FQRSLPETII 480 

SMASLTLGNI LPVSKNGNAQ VNGPVISTVI QNYSINEVFL FFSKIESNLS QPHCVFWDPS 540 

HLQWNDAGCH LVNETQDIVT CQCTHLTSFS ILMSPFVPST IFPWKWITY VGLGISIGSL 600 

ILCLIIEALF WKQIKKSQTS HTRRICMVNI AIiSLLIADVW FIVGATVDTT VNPSGVCTAA 6 60 

VFFTHFFYLS LFFWMLMLGI LLAYRI ILVF HHMAQHLMMA VGFCLGYGCP LIISVITIAV 72 0 

TQPSNTYKRK DVCWLNWSNG SKPLLAFWP ALAIVAVNFV WLLVLTKLW RPTVGERLSR 780 

DDKATIIRVG KSLLILTPLL GLTWGFGIGT IVDSQNLAWH VIFALLNAFQ GFFILCFGIL 840 

LDSKLRQLLF NKLSALSSWK QTEKQNSSDL SAKPKFSKPF NPLQNKGHYA FSHTGDSSDN 900 
IMI*TQFVSNE 

Seq ID NO: 50 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 63-3224 

1 11 21 31 41 51 

I I I I ! I 

CAGGAATTTG TGGCGGAGAG GGCAAATAAC TGCGGCTCTC CCGGCGCCCC GATGCTCGCA 60 

CCATGTCGAG GCGCAAGCAG GCGAAACCCC AGCACATCAA CTCGGAGGAG GACCAGGGCG 120 

AGCAGCAGCC GCAGCAGCAG ACCCCGGAGT TTGCAGATGC GGCCCCAGCG GCGCCCGCGG 180 

CGGGGGAGCT GGGTGCTCCA GTGAACCACC CAGGGAATGA CGAGGTGGCG AGTGAGGATG 240 

AAGCCACAGT AAAGCGGCTT CGTCGGGAGG AGACGCACGT CTGTGAGAAA TGCTGTGCGG 300 

AGTTCTTCAG CATCTCTGAG TTCCTGGAAC ATAAGAAAAA TTGCACTAAA AATCCACCTG 360 

TCCTCATCAT GAATGACAGC GAGGGGCCTG TGCCTTCAGA AGACTTCTCC GGAGCTGTAC 420 

TGAGCCACCA GCCCACCAGT CCCGGCAGTA AGGACTGTCA CAGGGAGAAT GGCGGCAGCT 480 

CAGAGGACAT GAAGGAGAAG CCGGATGCGG AGTCTGTGGT GTACCTAAAG ACAGAGACAG 540 

CCCTGCCACC CACCCCCCAG GACATAAGCT ATTTAGCCAA AGGCAAAGTG GCCAACACTA 600 

ATGTGACCTT GCAGGCACTA CGGGGCACCA AGGTGGCGGT GAATCAGCGG AGCGCGGATG 660 

CACTCCCTGC CCCCGTGCCT GGTGCCAACA GCATCCCGTG GGTCCTCGAG CAGATCTTGT 720 

GTCTGCAGCA GCAGCAGCTA CAGCAGATCC AGCTCACCGA GCAGATCCGC ATCCAGGTGA 780 

ACATGTGGGC CTCCCACGCC CTCCACTCAA GCGGGGCAGG GGCCGACACT CTGAAGACCT 840 

TGGGGAGCCA CATGTCTCAG CAGGTTTCTG CAGCTGTGGC TTTGCTCAGC CAGAAAGCTG 900 
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GAAGCCAAGG 
CTTCTGCCAC 
CCCGGGTGCT 
GCTCGGTGCT 
AGGGGAAGCC 
ACAAGCACAA 
ACCTCCGCTC 
CCACCAAGGG 
AGCTGTTTGC 
CTGTACCTGA 
CCACCTCTGT 
GTGGCTCCTT 
CACTCCCTGG 
GGACCCCTGA 
AGGCCACCAC 
CCCTCAAGAT 
GCCGAGCCTT 
CATCCATTAA 
TGCTGCAGCA 
AGAATCCCTG 
GCGCTATCTG 
AGGCTCCCAG 
CCACGCTAGG 
CTTTTAACCT 
CCAACGACTC 
TGGAAACCAC 
CAAAGTCTCC 
AAGGTCGGAG 
AAGTTCCTGG 
CAGCCCAGCC 
CGTCTGCTAG 
GCAACATTTG 
ACGGGGCGAA 
CTCTGTTAGG 
CTTCAGTGAA 
TGGCCGTGAA 
TTTCCTTGGG 
AGTCGGGTAT 
AGTTTCCTCA 
AAGGAGCAAT 
TCCTCCTGTT 
ACAACCTCAG 



TCTGTCTCTG 
CAGCTCCCTG 
CCCGAACGTC 
CTTCCAGAGC 
ACCGAACATC 
GTGTAAGTAC 
CCACACTGGA 
CAACCTCAAG 
CGAGTTCCAG 
CCCCATAGAT 
AGGGCTACCT 
GCCCGGTGAC 
GGTGGGACCA 
GCCAGGGTCA 
TGATCCCAAC 
GCATTATCGC 
TTCTACCAAA 
GACGCAGCAT 
ACATATTCGG 
TGACTTTACG 
CCATGATGAT 
CAGCTCCTCC 
GTTTGCCATG 
GCAGCGCCAG 
ATCCTCGCTG 
ATCCTTCCAG 
CGATGCTGGG 
CAGTCTCCCT 
CACATTTGTG 
ACGCCGACAG 
CGCTCTTCAG 
TGGGCGAGCT 
CAATAACTCA 
TACGGACGGA 
TGTGGACCCT 
GACCAATGAG 
GGCCACCTCC 
CAGTGCAGAT 
CTTCCTGGAA 
GCAGACACAG 
TTCTTTTTCT 
GCAAGTGCTA 



GATGCCTTGA 
TCCCCAGGGC 
ATGTCCCGCC 
CCTTTCTCCA 
TCCGCGGTGG 
TGTAGCAAGG 
GAGAGACCCT 
GTGCACTTTC 
GACAAAGTGG 
GAACCGAGTC 
CAGAATCTTT 
CTGCAGCCTG 
AACTATAATT 
GAGACCCTGA 
GAATGTCTCA 
ACCCACACCG 
GGTAACCTGA 
TCGTGCCCCA 
ATGCACATGG 
GGTTCTGAGC 
GTCATCGAAA 
AAGGTCCCCA 
ATGGCTTCCT 
GGCAGCAGAG 
ATGGGAGACC 
GCACTCTCCC 
AGCAAAGCAG 
TCCACGTTTA 
GGACCCTOGA 
GCCAAGCAAC 
ATCCACGAGC 
TTTACCACCA 
GCCCGCCGTG 
AAAAGAGTCT 
GTTGTGTGGA 
ATCTCTGTGA 
GTTGTGAATA 
GTGGAAAAAC 
GAAAACAAGA 
TGAAATCTCT 
TACTGATATG 
CAATCACGAT 



AACAAGCCAA 
TGGCACCCTT 
TCCCGAGCGC 
CTGTGGCGCT 
ATGTCAAACC 
TTTTTGGGAC 
TCGTGTGCTC 
ACCGACATCC 
CGGCCGGCAA 
TTTCTTTAGA 
CTTCGGGGAC 
GGCCTTCTCC 
CCCCAAGGGC 
AATTGCAGCA 
TTTGCCACCG 
GGGAGAGACC 
AGACACACCT 
TCTGCCAGAA 
GCGGTCAGAT 
CAATGACCGT 
GCATCGATGT 
CGCCTCTTCC 
TAGATGCCCC 
AAAACGGTTC 
AGGAGTATCA 
CGGCCAATAG 
AGAGCTCCGA 
TCCGAGCCCC 
CATTGTCCCC 
ATGGCTGCAC 
GGACTCACAC 
AAGGCAACTT 
GAAGGAAGTT 
CAGAAATCTT 
ACCAGTACAC 
TCCAGAGTGG 
ACGCCACTGT 
CAAGTGCTAC 
TTGCGGTCAG 
AGAATCTGCT 
CAAATGATGT 
TGTTGCTATG 



GCTACCTCAC 
CACTCTGAAG 
TTTGCTTCCT 
AGACACATCC 
CAAAGACGAG 
TGATAGCTCC 
TGTCTGTGGT 
CCAGGTGAAG 
TGGCATCCCC 
CAGCAAACCT 
TAATCCCAAG 
AGAAAGTGAG 
TGGTGGCTTC 
GTTGGTGGAG 
AGTCTTAAGC 
GTTCCAGTGT 
TGGGGTTCAC 
GAAGTTCACT 
TCCCAACACG 
GGGTGAGAAC 
AGAGGAAGTC 
CAGCATCCAC 
AGGGAAAGTG 
CGTGGAGAGC 
GAGCCGAAGC 
TCAAGCCGAA 
GAACAGCCGC 
GCCGACCTAT 
AGGGATGACC 
ACGGTGTGGG 
TGGAGAGAAG 
AAAGGTTCAC 
GGCCATCGAG 
TCCCAAGGAA 
CAGCATGCTC 
GGGGGTTCCT 
CTCCAAGATG 
TGACGGCGTT 
CTAAGGGAGA 
TTGTTTTGTA 
TTACTACGTT 
CTGCTTTGCA 



GCCAACATCC 
CCGGATGGGA 
CAGGCCCCGG 
AAGAAAGGGA 
GCGGCCCTCT 
TTGCAGATCC 
CATCGCTTCA 
GCAAACCCCC 
TATGCACTCT 
GTCCTTGTAA 
GACCTCACGG 
GGTGGACCCA 
CAAGGGAGTG 
AACATTGACA 
TGTCAGAGCT 
AAGATCTGTG 
CGAACCAACA 
AATGCCGTGA 
CCCCTGCCAG 
GGCAGCACCG 
AGCTCCCAGG 
TCGGCATCAC 
GGTCCTGCCC 
GATGGCTTGA 
CCAGATATCC 
AGCATCAAGT 
ACTGAGATGG 
GTCAAGGTTG 
CCTTTGTTAG 
AAGAACTTCT 
CCTTTTGTGT 
TACATGACAC 
AACACCATGG 
ATCCTGGCCC 
AATGGCGGTC 
ACCCTCCCGG 
GATGGCTCCC 
CCCAAACACC 
ACTTGCGTGG 
AGAACTCATC 
GGTTGTGACC 
AAAAGTTG 



960 
1020 
10B0 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 



Seq ID NO; 51 Protein sequence: 
Protein Accession #: NP_065169.1 



MSRRKQAKPQ 
ATVKRLRREE 
SHQPTSPGSK 
VTLQALRGTK 
MWASHALHSS 
SATSSLSPGL 
GKPPNISAVD 
TKGKLKVHFH 
TSVGLPQNLS 
TPEPGSETLK 
RAFSTKGNLK 
NPCDFTGSEP 
TLGFAMMASL 
ETTSFQALSP 
VPGTFVGPST 
NICGRAPTTK 
SVNVDPWWN 
SG1SADVEKP 



11 
I 

HIMSEEDOGE 
THVCEKCCAE 
DCHRENGGSS 
VAVNQRSADA 
GAGADTLKTL 
APFTLKPDGT 
VKPKDEAALY 
RHPQVKANPQ 
SGTNPKDLTG 
LQQLVENIDK 
THLGVHRTNT 
MTVGBNGSTG 
DAPGKVGPAP 
ANSQAESIKS 
LSPGMTPLLA 
GNLKVHYMTH 
QYTSMLNGGL 
SATDGVPKHQ 



21 
I 

QQPQQQTPEF 

FFSISEFLEH 
EDMKEKPDAE 
LPAPVPGANS 
GSHMSQQVSA 
RVLPNVMSRL 
KHKCKYCSKV 
LFAEFQDKVA 
GSLPGDLQPG 
ATTDPNECLI 
SIKTQHSCPI 
AICHDDVIES 
FNLQRQGSRE 
KSPDAGSKAE 
AQPRRQAKQH 
GANNNSARRG 
AVKTNEISVI 
FPHFLEENKI 



31 
I 

ADAAPAAPAA 
KKNCTKNPPV 
SWYLKTETA 
IPWVLEQILC 
AVALLSQKAG 
PSALLPQAPG 
FGTDSSLQIH 
AGNGI PYALS 
PSPBSEGGPT 
CHRVLSCQSS 
CQKKFTNAVM 
IDVEEVSSQE 
NGSVESDGLT 
SSENSRTEME 
GCTRCGKNFS 
RKLAIENTMA 
QSGGVPTLPV 
AVS 



41 

I 

GELGAPVNHP 
LIMNDSEGPV 
LPPTPQDISY 
LQQQQLQQ1Q 
SQGLSLDAIiK 
SVLFQSPFST 
LRSHTGERPF 
VPDPIDEPSL 
LPGVGPNYNS 
LKMHYRTHTG 
LQQHIRMHMG 
APSSSSKVPT 
NDSSSLMGDQ 
GRSSLPSTFI 
SASALQIHER 
LLGTDGKRVS 
SLGATSWNN 



Seq ID NO: 52 DNA sequence 

Nucleic Acid Accession ft: CAT cluster 



TCTAGCATTA 
AACTTGCAGA 
TCTGTGAGTT 
ATTTGAAGAT 
ATTCCACAAA 
TGAATACACA 



11 

I 

AAGGAAGAAA 
CATAACAAGC 
GAAGGCACAC 
ATTTCCTTTT 
AAGAGTGTTT 
CAACACAAGG 



21 
I 

CCCCGTCTTG 
AGAGTGTTTC 
ATCACAAAGT 
CTACTGTTGG 
CAAATCTGCT 
AAGTTACTGA 



31 
I 

CAACGAAGGC 
TAAACTGCTC 
AGTTTCTGAG 
CATCAAATCG 
CTGTGTAAAG 
GAATTCTTCT 



41 



51 
1 

GNDEVASEDE 
PSEDFSGAVL 
LAKGKVANTN 
LTEQIRIQVN 
QAKLPHANIP 
VALDTSKKGK 
VCSVCGHRFT 
SLDSKPVLVT 
PRAGGFQGSG 
ERPFQCKICG 
GQIPNTPLPE 
PLPSIHSASP 
EYQSRSPDIL 
RAPPTYVKVE 
THTGEKPFVC 
EIFPKEILAP 
ATVSKMDGSQ 



51 
I 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



CTCAAAGAGG TCCAAATATC 
TAAGAAAAGA AAGGTTAAAC 
AATGATTCTG TCTAGTTTTT 
CTTGAAATCT CCACTTGCAA 
GGACGTTCCA CTCTGTGAGT 
GTCTAGCATG ATATGAAGAA 
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120 

180 
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10 
15 
20 
25 



65 



ATCCCGTTTC CAACGAAGGC CTCAATGAGG TCTATATATC CACTTGCAGA CTTTACAAAC 420 

AGAGTGTTTC CTAACTGCTC TATGAAAAGA AAGGTTAAAC TCTGTGGAGT TGAACGCACA 480 

CATCACAAAA GGAGTTTCTG AGAATCATTC TGTCTAGTTT TTTTAGGAAG ATATTTCCTT 540 
TTCTACCGTT GACTTCAAAG CGGCTGAAAT 

Seq ID NO: 53 DNA sequence 

Nucleic Acid Accession 8: NM_001910.1 

Coding sequence: 50 . . 1240 



1 11 21 31 41 51 

I I I I I I 

GGAGAGAAGA AAGGAGGGGG CAAGGGAGAA GCTGCTGGTC GGACTCACAA TGAAAACGCT 60 

CCTTCTTTTG CTGCTGGTGC TCCTGGAGCT GGGAGAGGCC CAAGGATCCC TTCACAGGGT 120 

GCCCCTCAGG AGGCATCCGT CCCTCAAGAA GAAGCTGCGG GCACGGAGCC AGCTCTCTGA 180 

GTTCTGGAAA TCCCATAATT TGGACATGAT CCAGTTCACC GAGTCCTGCT CAATGGACCA 240 

GAGTGCCAAG GAACCCCTCA TCAACTACTT GGATATGGAA TACTTCGGCA CTATCTCCAT 3 00 

TGGCTCCCCA CCACAGAACT TCACTGTCAT CTTCGACACT GGCTCCTCCA ACCTCTGGGT 360 

CCCCTCTGTG TACTGCACTA GCCCAGCCTG CAAGACGCAC AGCAGGTTCC AGCCTTCCCA 420 

GTCCAGCACA TACAGCCAGC CAGGTCAATC TTTCTCCATT CAGTATGGAA CCGGGAGCTT 480 

GTCCGGGATC ATTGGAGCCG ACCAAGTCTC TGTGGAAGGA CTAACCGTGG TTGGCCAGCA 540 

GTTTGGAGAA AGTGTCACAG AGCCAGGCCA GACCTTTGTG GATGCAGAGT TTGATGGAAT 500 

TCTGGGCCTG GGATACCCCT CCTTGGCTGT GGGAGGAGTG ACTCCAGTAT TTGACAACAT 660 

GATGGCTCAG AACCTGGTGG ACTTGCCGAT GTTTTCTGTC rACATGAGCA GTAACCCAGA 720 

AGGTGGTGCG GGGAGCGAGC TGATTTTTGG AGGCTACGAC CACTCCCATT TCTCTGGGAG 780 

CCTGAATTGG GTCCCAGTCA CCAAGCAAGC TTACTGGCAG ATTGCACTGG ATAACATCCA 840 

GGTGGGAGGC ACTGTTATGT TCTGCTCCGA GGGCTGCCAG GCCATTGTGG ACACAGGGAC 900 

TTCCCTCATC ACTGGCCCTT CCGACAAGAT TAAGCAGCTG CAAAACGCCA TTGGGGCAGC 960 

CCCCGTGGAT GGAGAATATG CTGTGGAGTG TGCCAACCTT AACGTCATGC CGGATGTCAC 1020 

CTTCACCATT AACGGAGTCC CCTATACCCT CAGCCCAACT GCCTACACCC TACTGGACTT 1080 

30 CGTGGATGGA ATGCAGTTCT GCAGCAGTGG CTTTCAAGGA CTTGACATCC ACCCTCCAGC 1140 

TGGGCCCCTC TGGATCCTGG GGGATGTCTT CATTCGACAG TTTTACTCAG TCTTTGACCG 1200 

TGGGAATAAC CGTGTGGGAC TGGCCCCAGC AGTCCCCTAA GGAGGGGCCT TGTGTCTGTG 1260 

CCTGCCTGTC TGACAGACCT TGAATATGTT AGGCTGGGGC ATTCTTTACA CCTACAAAAA 1320 

GTTATTTTCC AGAGAATGTA GCTGTTTCCA GGGTTGCAAC TTGAATTAAG ACCAAACAGA 1380 

35 ACATGAGAAT ACACACACAC ACACACATAT ACACACACAC ACACTTCACA CATACACACC 1440 

ACTCCCACCA CCGTCATGAT GGAGGAATTA CGTTATACAT TCATATTTTG TATTGATTTT 1500 

TGATTATGAA AATCAAAAAT TTTCACATTT GATTATGAAA ATCTCCAAAC ATATGCACAA 1560 

GCAGAGATCA TGGTATAATA AATCCCTTTG CAACTCCACT CAGCCCTGAC AACCCATCCA 1620 

CACACGGCCA GGCCTGTTTA TCTACACTGC TGCCCACTCC TCTCTCCAGC TCCACATGCT 1680 

40 GTACCTGGAT CATTCTGAAG CAAATTCCGA GCATTACATC ATTTTGTCCA TAAATATTTC 1740 

TAACATCCTT AAATATACAA TCGGAATTCA AGCATCTCCC ATTGTCCCAC AAATGTTTGG 1800 

CTGTTTTTGT AGTTGGATTG TTTGTATTAG GATTCAAGCA AGGCCCATAT ATTGCATTTA 1860 

TTTGAAATGT CTGTAAGTCT CTTTCCATCT ACAGAGTTTA GCACATTTGA ACGTTGCTGG 1920 

TTGAAATCCC GAGGTGTCAT TTGACATGGT TCTCTGAACT TATCTTTCCT ATAAAATGGT 1980 

45 AGTTAGATCT GGAGGTCTGA TTTTGTGGCA AAAATACTTC CTAGGTGGTG CTGGGTACTT 2040 

CTTGTTGCAT CCTGTCAGGA GGCAGATAAT GCTGGTGCCT CTCTATTGGT AATGTTAAGA 2100 
CTGCTGGGTG GGTTTGGAGT TCTTGGCTTT AATCATTCAT TACAAAGTTC AGCATTTT 

Seq ID NO: 54 Protein Sequence 
50 Protein Accession ft: NP_001901.l 

1 11 21 31 41 51 

I I I I 1 I 

MKTLLLLLLV LLELGEAQGS LHRVPLiRRHP SLKKKLRARS QLSEFWKSHN LDMIQFTESC 60 

55 SMDQSAKEPL INYLDMEYFG TISIGSPPQN FTVIFDTGSS NLWVPSVYCT SPACKTHSRF 120 

QPSQSSTYSQ PGQSFSIQYG TGSLSGIIGA DQVSVEGLTV VGQQFGESVT EPGQTFVDAE 180 

FDGILGLGYP SLAVGGVTPV FDNMMAQNLV DLPMFSVYMS SNPEGGAGSE LIFGGYDHSH 240 

FSGSLNWVPV TKQAYWQIAL DNIQVGGTVM FCSEGCQAIV DTGTSLITGP SDKIKQLQNA 300 

IGAAPVDGEY AVECANLNVM PDVTFTINGV PYTLSPTAYT LLDFVDGMQF CSSGFQGLDI 360 

60 HPPAGPLWIL GDVFIRQFYS VFDRGNNRVG IAPAVP 



Seq ID NO: 55 DNA sequence 

Nucleic Acid Accession #: NM_018058.1 

Coding sequence: 319. .1575 



1 11 21 31 41 51 

I I I I I I 

TACGCGCTGC GGGACCGGCA GGGGAACGCC ATCGGGGTCA CAGCCTGCGA CATCGACGGG 60 

GACGGCCGGG AGGAGATCTA CTTCCTCAAC ACCAATAATG CCTTCTCGGG GGTGGCCACG 120 

70 TACACCGACA AGTTGTTCAA GTTCCGCAAT AACCGGTGGG AAGACATCCT GAGCGATGAG 180 

GTCAACGTGG CCCGTGGTGT GGCCAGCCTC TTTGCCGGAC GCTCTGTGGC CTGTGTGGAC 240 
AGAAAGGGCT CTGGACGCTA CTCTATCTAC ATTGCCAATT ACGCCTACGG TAATGTGGGC • 300 

CCTGATGCCC TCATTGAAAT GGACCCTGAG GCCAGTGACC TCTCCCGGGG CATTCTGGCG 360 

CTCAGAGATG TGGCTGCTGA GGCTGGGGTC AGCAAATATA CAGGGGGCCG AGGCGTCAGC 420 

75 GTGGGCCCCA TCCTCAGCAG CAGTGCCTCG GATATCTTCT GCGACAATGA GAATGGGCCT 480 

AACTTCCTTT TCCACAACCG GGGCGATGGC ACCTTTGTGG ACGCTGCGGC CAGTGCTGGT 540 

GTGGACGACC CCCACCAGCA TGGGCGAGGT GTCGCCCTGG CTGACTTCAA CCGTGATGGC 600 

AAAGTGGACA TCGTCTATGG CAACTGGAAT GGCCCCCACC GCCTCTATCT GCAAATGAGC 660 
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ACCCATGGGA AGGTCCGCTT CCQGGACATC GCCTCACCCA AGTTCTCCAT GCCCTCCCCT 720 

GTCCGCACGG TCATCACCGC CGACTTTGAC AATGACCAGG AGCTGGAGAT CTTCTTCAAC 780 

AACATTGCCT ACCGCAGCTC CTCAGCCAAC CGCCTCTTCC GCGTCATCCG TAGAGAGCAC 840 

GGAGACCCCC TCATCGAGGA GCTCAATCCC GGCGACGCCT TGGAGCCTGA GGGCCGGGGC 900 

ACAGGGGGTG TGGTGACCGA CTTCGACGGA GACGGGATGC TGGACCTCAT CTTGTCCCAT 960 

GGAGAGTCCA TGGCTCAGCC GCTGTCCGTC TTCCGGGGCA ATCAGGGCTT CAACAACAAC 1020 

TGGCTGCGAG TGGTGCCACG CACCCGGGTT GGGGCCTTTG CCAGGGGAGC TAAGGTCGTG 1080 

CTCTACACCA AGAAGAGTGG GGCCCACCTG AGGATCATCG ACGGGGGCTC AGGCTACCTG 1140 

TGTGAGATGG AGCCCGTGGC ACACTTTGGC CTGGGGAAGG ATGAAGCCAG CAGTGTGGAG 1200 

GTGACGTGGC CAGATGGCAA GATGGTGAGC CGGAACGTGG CCAGCGGGGA GATGAACTCA 1260 

GTGCTGGAGA TCCTCTACCC CCGGGATGAG GACACACTTC AGGACCCAGC CCCACTGGAG 1320 

ACACCAATGA ATGCATCCAG TTCCCATTCG TGTGCCCTCG AGACAAGCCC GTATGTGTCA 1380 

ACACCTATGG AAGCTACAGG TGCCGGACCA ACAAGAAGTG CAGTCGGGGC TACGAGCCCA 1440 

ACGAGGATGG CACAGCCTGC GTGGGGACTC TCGGCCAGTC ACCGGGCCCC CGCCCCACCA IS 00 

CCCCCACCGC TGCTGCTGCC ACTGCCGCTG CTGCTGCCGC TGCTGGAGCT GCCACTGCTG 1560 

CACCGGTCCT CGTAGATGGA GATCTCAATC TGGGGTCGGT GGTTAAGGAG AGCTGCGAGC 1620 

CCAGCTGCTG AGCAGGGGTG GGACATGAAC CAGCGGATGG AGTCCAGCAG GGGAGTGGGA 1680 

AAGTGGGCTT GTGCTGCTGC CTAGACAGTA GGGATGTAAA GGCCTGGGAG CTAGACCCTC 1740 

CCCAAGCCCA TCCATGCACA TTACTTAGCT AACAATTAGG GAGACTCGTA AGGCCAGGCC 1800 

CTGTGCTGGG CACATAGCTG TGATCACAGC AGACAGGGTC GCTGCCCTGA TGGCGCTTAC I860 

ATTCCAGTGG GTCTAATGAC CATATCTTAG GACACAGATG TGCCCAGGGA GGTGGTGTCA 1920 

CTGCACAGGA AGTATGAGGA CTTTAGTGTC CTGAGTTCAA ATCCTGATTC AGGAACTCAC 1980 

AAAGCTATGT GACCTTACAC CAGTCACTTA ACTTGTTAGC CATCCATTAT CGCATCTGCA 2040 

AAATGGGGAT TAAGAATAGA ATCTTGGGGT TAGTGTGGAG ATTAGATTAA ATGTATGTAA 2100 

GACACTTGGC ACAAAACCTG GCACATAGTA AAGGCTCAAT AAAAACAAGT GCCTCTCACT 2160 
GGGCTTTGTC AACACGTG 



Seq ID NO i 56 Protein Sequence 
Protein Accession #: NPJJ60528.1 

1 11 21 31 41 51 

MDPEASDLSR GILALRDVAA EAGVSKYTGG RGVSVGP1LS SSASDIFCDN ENGPNFLFHN 60 

RGDGTFVDAA ASAGVDDPHQ HGRGVALADF NRDGKVDIVY GNWNGPHRLY LQMSTHGKVR 120 

FRDIASPKFS MPSPVRTVIT ADFDNDQELE IFFNNIAYRS SSANRLFRVt RREHGDPLIE 180 

ELNPGDALEP EGRGTGGWT DFDGDGMLDL ILSHGESMAQ PLSVFRGNQG FNNNWLRWP 240 

RTRVGAFARG AKWLYTKKS GAHLRI IDGG SGYLCEMEPV AHFGLGKDEA SSVEVTWPDG 300 

KMVSRNVASG EMNSVLEILY PRDEDTLQDP APLETPMNAS SSHSCALETS PYVSTPMEAT 360 
GAGPTRSAVG ATSPTRMAQP AWGIiSASHRA PAPPPPPLLL PLPLLLPLLE LPLLHRSS 



Seq ID NO: 57 DNA sequence 

Nucleic Acid Accession #: AJ279016.1 

Coding sequence: 1..1962 



1 11 21 31 |1 51 

ATGTCCAGGA TGTTACCGTT CCTGCTGCTG CTCTGGTTTC TGCCCATCAC TGAGGGGTCC 60 

CAGCGGGCTG AACCCATGTT CACTGCAGTC ACCAACTCAG TTCTGCCTCC TGACTATGAC 120 

AGTAATCCCA CCCAGCTCAA CTATGGTGTG GCAGTTACTG ATGTGGACCA TGATGGGGAC 180 

TTTGAGATCG TCGTGGCGGG GTACAATGGA CCCAACCTGG TTCTGAAGTA TGACCGGGCC 240 

CAGAAGCGGC TGGTGAACAT CGCGGTCGAT GAGCGCAGCT CACCCTACTA CGCGCTGCGG 300 

GACCGGCAGG GGAACGCCAT CGGGGTCACA GCCTGCGACA TCGACGGGGA CGGCCGGGAG 360 

GAGATCTACT TCCTCAACAC CAATAATGCC TTCTCGGGGG TGGCCACGTA CACCGACAAG 420 

TTGTTCAAGT TCCGCAATAA CCGGTGGGAA GACATCCTGA GCGATGAGGT CAACGTGGCC 4 80 

CGTGGTGTGG CCAGCCTCTT TGCCGGACGC TCTGTGGCCT GTGTGGACAG AAAGGGCTCT 540 

GGACGCTACT CTATCTACAT TGCCAATTAC GCCTACGGTA ATGTGGGCCC TGATGCCCTC 600 

ATTGAAATGG ACCCTGAGGC CAGTGACCTC TCCCGGGGCA TTCTGGCGCT CAGAGATGTG 660 

GCTGCTGAGG CTGGGGTCAG CAAATATACA GGGGGCCGAG GCGTCAGCGT GGGCCCX^C 720 

CTCAGCAGCA GTGCCTCGGA TATCTTCTGC GACAATGAGA ATGGGCCTAA CTTCCTTTTC 780 

CACAACCGGG GCGATGGCAC CTTTGTGGAC GCTGCGGCCA GTGCTGGTGT GGACGACCCC 840 

CACCAGCATG GGCGAGGTGT CGCCCTGGCT GACTTCAACC GTGATGGCAA AGTGGACATC 900 

GTCTATGGCA ACTGGAATGG CCCCCACCGC CTCTATCTGC AAATGAGCAC CCATGGGAAG 960 

CTCACCCAAG TTCTCCATGC CCTCCCCTGT CCGCACGGTC 1020 

ATCACCGCCG ACTTTGACAA TGACCAGGAG CTGGAGATCT TCTTCAACAA CATTGCCTAC 1080 

CGCAGCTCCT CAGCCAACCG CCTCTTCCGC GTCATCCGTA GAGAGCACGG AGACCCCCTC 1140 

ATCGAGGAGC TCAATCCCGG CGACGCCTTG GAGCCTGAGG GCCGGGGCAC AGGGGGTGTG 1200 

GTGACCGACT TCGACGGAGA CGGGATGCTG GACCTCATCT TGTCCCATGG AGAGTCCATG 1260 

GCTCAGCCGC TGTCCGTCTT CCGGGGCAAT CAGGGCTTCA ACAACAACTG GCTGCGAGTG 1320 

^CCACGCA CCCGGTTTGG GGCCTTTGCC AGGGGAGCTA AGGTCGTGCT CTACACCAAG 1380 

AAGAGTGGGG CCCACCTGAG GATCATCGAC GGGGGCTCAG GCTACCTGTG TGAGATGGAG 1440 

CCCGTGGCAC ACTTTGGCCT GGGGAAGGAT GAAGCCAGCA GTGTGGAGGT GACGTGGCCA 1500 

GATGGCAAGA TGGTGAGCCG GAACGTGGCC AGCGGGGAGA TGAACTCAGT GCTGGAGATC 1560 

CTCTACCCCC GGGATGAGGA CACACTTCAG GACCCAGCCC CACTGGAGTG TGGCCAAGGA 1620 

SScCCAGC AGGAAAATGG CCATTGCATG GACACCAATG AATGCATCCA GTTCCCATTC 1680 

GTGTGCCCTC GAGACAAGCC CGTATGTGTC AACACCTATG GAAGCTACAG GTGCCGGACC 1740 

AACAAGAAGT GCAGTCGGGG CTACGAGCCC AACGAGGATG GCACAGCCTG CGTGGGGACT 1800 

CTCGGCCAGT CACCGGGCCC CCGCCCCACC ACCCCCACCG CTGCTGCTGC CACTGCCGCT I860 

GCTGCTGCCG CTGCTGGAGC TGCCACTGCT GCACCGGTCC TCGTAGATGG AGATCTCAAT 1920 
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PCT/US02/21338 



CTGGGGTCGG TGGTTAAGGA GAGCTGCGAG CCCAGCTGCT GAGCAGGGGT GGGACATGAA 1980 

CCAGCGGATG GAGTCCAGCA GGGGAGTGGG AAAGTGGGCT TGTGCTGCTG CCTAGACAGT 2040 

AGGGATGTAA AGGCCTGGGA GCTAGACCCT CCCCAAGCCC ATCCATGCAC ATTACTTAGC 2100 

TAACAATTAG GGAGACTCGT AAGGCCAGGC CCTGTGCTGG GCACATAGCT GTGATCACAG 2160 

J CAGACAGGGT CGCTGCCCTG ATGGCGCTTA CATTCCAGTG GGTCTAATGA CCATATCTTA 2220 

GGACACAGAT GTGCCCAGGG AGGTGGTGTC ACTGCACAGG AAGTATGAGG ACTTTAGTGT 2280 

CCTGAGTTCA AATCCTGATT CAGGAACTCA CAAAGCTATG TGACCTTACA CCAGTCACTT 2340 

AACTTGTTAG CCATCCATTA TCGCATCTGC AAAATGGGGA TTAAGAATAG AATCTTGGGG 2400 

TTAGTGTGGA GATTAGATTA AATGTATGTA AGACACTTGG CACAAAACCT GGCACATAGT 2460 
10 AAAGGCTCAA TAAAAACAAG TGCCTCTCAC TGGGCTTTGT CAACACG 



15 



20 



35 



Seq ID NO: SB Protein Sequence 
Protein Accession #: CAC08451.1 



1 11 21 31 41 51 

I I I I I I 

MSRMLPFI*DL LWFLPITEGS QRAEPMFTAV TNSVLPPDYD SNPTQLNYGV AVTDVDHDGD 60 

FEIWAGYNG PNLVLKYDRA QKRLVNIAVD ERSSPYYALR DRQGNAIGVT ACDIDGDGRE 120 

EIYFLNTNNA PSGVATYTDK LFKFRNNRWE DILSDEVNVA RGVASLFAGR SVACVDRKGS 180 

GRYSIYIANY AYGNVGPDAL IEMDPEASDL SRGILALRDV AAEAGVSKYT GGRGVSVGPI 240 

LSSSASDIFC DNENGPNFLF HNRGDGTFVD AAASAGVDDP KQHGRGVALA DFMRDGKVDI 300 

VYGNWNGPHR LYLQMSTHGK VRFRDIASPK FSMPSPVRTV ITADFDNDQE LEIFFNNIAY 360 

RSSSANRLFR VIRREHGDPL IEELNPGDAL EPEGRGTGGV VTDFDGDGML DLILSHGESM 420 

AQPLSVFRGN QGFNNNWLRV VPRTRFGAFA RGAKWLYTK KSGAHLRI ID GGSGYLCEME 480 

2j PVAHFGLGKD EASSVEVTWP DGKMVSRNVA SGEMNSVLEI LYPRDEDTLQ DPAPLECGQG 540 

FSQQENGHCM DTNECIQFPF VCPRDKPVCV NTYGSYRCRT NKKCSRGYEP NEDGTACVGT 600 
LGQSPGPRPT TPTAAAATAA AAAAAGAATA APVLVDGDLN LGSWKESCE PSC 

Seq ID NO: 59 DNA sequence 
30 Nucleic Acid Accession #: FGENESH 
Coding sequence: 1..4794 

1 11 21 31 41 51 

I I I I I I 

ATGGCGTGTC CGGGAGGACT CCCAGCCCGT TGCTCTGGTT GGATGGGACT GGGTGGGCCC 60 

AGCGGCTCCT CCCCAGCATC CCCTCCCCAT TCCTCCTCCA GGTACAATGG ACCCAACCTG 120 

GTTCTGAAGT ATGACCGGGC CCAGAAGCGG CTGGTGAACA TCGCGGTCGA TGAGCGCAGC 180 

TCACCCTACT ACGCGCTGCG GGACCGGCAG GGGAACGCCA TCGGGGTCAC AGCCTGCGAC 240 

ATCGACGGGG ACGGCCGGGA GGAGATCTAC TTCCTCAACA CCAATAATGC CTTCTCGGGC 300 

40 CACAGCAGCT CAGCGCAGGT CCCTTCTGGG CTCCACAGAA ACAGGCCTGT GCTGAAGCCT 360 

CCACCTACAA CCCCTGCAGG CCTCCTGGGT CTGCCTCCAC TCAGCGGAAG GGACTTTTCC 420 

TCCTCCCTGG GTCAGGCTTC TCCGGACAGC AGGCAGGGAG AGAGGGTGCC GGTTCCCTGC 480 

TGTCGGGGTG GACTGAGACC TACCCATGAA CCAGAACCAT TTCTTCTGAG ACCCAAATCA 540 

GGGGTGGCCA CGTACACCGA CAAGTTGTTC AAGTTCCGCA ATAACCGGTG GGAAGACATC 600 

45 CTGAGCGATG AGGTCAACGT GGCCCGTGGT GTGGCCAGCC TCTTTGCCGG ACGCTCTGTG 660 

GCCTGTGTGG ACAGAAAGGG CTCTGGACGC TACTCTATCT ACATTGCCAA TTACGCCTAC 720 

GGTAATGTGG GCCCTGATGC CCTCATTGAA ATGGACCCTG AGGCCAGTGA CCTCTCCCGG 780 

GGCATTCTGG CGCTCAGAGA TGTGGCTGCT GAGGCTGGGG TCAGCAAATA TACAGAAGGC 840 

TTCTCCCACA CTGCCTCTCC AAGCATTGGT GAGATATCTG GCAGAACCGA GGAGCGGGAA 900 

50 GGAGGAGACC CAGAGGAGGC AGATGAGGAG CACAGTGGGG ATGGAAGCAC CAGCCAACTG 960 

TGCCGGCTGG GCTGGAAGGA CGGGCAGTTC AAGGAAGAAG CAGCAGCTTT GGTGGAGGAA 1020 

CAGAGGGAGG CTGGGGCAGC TGGCGTGCCC AGAGGACGTG TTCGAACAGC TCTGCAGACT 1080 

TCCAAAAGCC ATTTGGCTGA CAAGAACCTA TTTGGCCCAC CATGTTACTA TTCTGTCTGC 1140 

GCGCCTTCTC CAGCCCACCC TTTCCCTGCC CGCCAAGCCC CCCAACACTA CCCTGTAGCC 1200 

55 CCCCTTGTCA CTCAGCTAAT GACACATGGA CGTCTGGCTG GAAAACTAGC CCGGAGTGTC 1260 

CCCCACCCCC GAGCCCCAGG AATGGACCCC AAATGTAAGG GCCGCCATGC TGAGCCCGGC 1320 

CTGATGGCTG AGGCTTTGGG CGCGTGGCCA GCGCTCAGCA CCACTGTGGT GCCAGGGGGC 1380 

CTGAGAAGCT GGGAGGAAAG CAGGCAGAAG GGGCAGGCCA TGTCCAGATG TGCACTCAGG 1440 

GAGCTGGGAG GTCCCTGGAG CCAAGCCACA CAGCACCTGC CTGCTAGAGA GCTGTATGAC 1500 

60 CTGGGAGAAC CTCCCATTTT ACAAAGAACA GACGGAGATC CAGGGAGGAG AAGGGACTCG 1560 

CCCAAGGTCA CACAGGAGTG CCATCTAGTG GCCACCATGC CAGCTCTCGG GGGACTCGAG 1620 

GGCCCCGGGA GGGTGGCCAA GCGAGAGATT GGGAGAGAGA CTGGGGCAGT AGGAAGACCA 1680 

CTCTCCCATC CCCTGGTCCC CAACTTCCCC AGCTGCTTGA GGCCTCTTGA AGCCGGGACA 1740 

GTGCCGGGAG CTGCCCTGCC TGGGAATCCT GGGAACTGGG TTCTGGACAT GGCCAAGGCC 1800 

65 CTGGCGTGGA ACCAGATGGA AAAAGAGGAG GGGAAGATTC ATGGAGACCA TGAGCCCAGA 1860 

TTTAGGCTCA GGAAAGCACG GGAAGCAGAA TTCCCCCCAG GCTCCTCTGA GGAGCCTCTG 1920 

CTGCAGTTCC CCTCAGGCCT CAGAGGCAGC CCTGTCCTCC AGGTGGGCCT GGGGCTTGCT 1960 

TCTGCCACTC ACTGTGGGTC GATGTCTTTT CTAGGGGGCC GAGGCGTCAG CGTGGGCCCC 2040 

ATCCTCAGCA GCAGTGCCTC GGATATCTTC TGCGACAATG AGAATGGGCC TAACTTCCTT 2100 

70 TTCCACAACC GGGGCGATGG CACCTTTGTG GACGCTGCGG CCAGTGCTGA ACGTCGTTTA 2160 

GCCTTCATCG TTCACCTCAA ATATCACCTC TGCAGAGATT TTCCTCACTC CCTGTGCCAC 2220 

CTAGCAGAAA CTGGTCCTTC CTCCTCCTGC TGCCCGTGGC ATGCACGTCT TCTTCAGGCT 2280 

CCACATTGCC ATCATGGTTT GTCTATGAGC TTTACAAGGA COGGGTCACG GTTCTATTCA 2340 

TTCTTGACGC AAGGCTTGGC CTCCAGTGCC CACCGGAGGA CACTCAGCCT CCAGGGTTCT 2400 

75 CAGGGGGCCC CACCCTGCCT TCTGGCAAGA GCTCCCTGTG TCCTGGGGTC TCTGATCCCC 2460 

ACTGCCTATT ACATTGTCCT GTGGTCTGCC ATCCCAGAGA GCCTGATGAC CCACAGCTAT 2520 

TTGTCCTCTG AAAGAGTCAA OGTGGGTGTG GACGACCCCC ACCAGCATGG GCGAGGTGTC 2580 

GCCCTGGCTG ACTTCAACCG TGATGGCAAA GTGGACATCG TCTATGGCAA CTGGAATGGC 2640 
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CCCCACCGCC TCTATCTGCA AATGAGCACC CATGGGAAGG TCCGCTTCCG GGACATCGCC 2700 

TCACCCAAGT TCTCCATGCC CTCCCCTGTC CGCACGGTCA TCACCGCCGA CTTTGACAAT 2760 

GACCAGGAGC TGGAGATCTT CTTCAACAAC ATTGCCTACC GCAGCTCCTC AGCCAACCGC 2820 

CTCTTCCGAT GCTCCATCCT GGCTCGTGGC TCTTCATCCT TGACAGCTGG TGGGAGGAAC 2880 

GGTCAGGGAG AAGGTTTAAG AATCAGAAGG GGAGGGTTCC CAGGGCCAGG GGGTCAGGCC 2940 

AAGGTCAACA CAGGTCCCCT GATGAAGAAA CAGAAAGGAA GGAAGGACGA GGACTGGGCA 3000 

AGAGGCTGTG GGAATGCAGG GCAAAGCCTG GCCAAGGAGC CGGCCTCTGC TATTGCAGGG 3060 

AAAGGGAAGG GAAATGTGGC CCAAAGTGTG CCCAGAACCC AAGCGCCACA AGATACAAAG .3120 

CCACACTACC ACAAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACGGG 3180 

GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC ACTACCAGGA AAAGGGGCTA 3240 

CGGGGTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC AATCACTACC AGGAAAAGGG 3300 

GCTACGGGCT CCAATCACTA CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA 3360 

AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA CAGGGTCCAA TCACTACCAC 3420 

AGAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGGG GCTACGGGGT CCAATCACTA 3480 

CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC 3540 

ACTACCAGGA AAAGGGGCTA CGGGCTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC 3600 

AATCACTACC AGGAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACAGG 3660 

GTCCAATCAC TACCACAGAA AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA 3720 

CGGGGTCCAA TCACTACCAG GAAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGAG 3780 

GCTATGGGGT CCAATCACTA CCAGGAAAAG GGGCTACGGG CTCCAATCAC TACCAGGAAA 3840 

AGGGGCTATG GGGTCCAATC ACTACCACAG AAAGGGGCTA CGGGGTCCAA CGTCATCCGT 3900 

AGAGAGCACG GAGACCCCCT CATCGAGGAG CTCAATCCCG GCGACGCCTT GGAGCCTGAG 3960 

GGCCGGGGCA CAGGGGGTGT GGTGACCGAC TTCGACGGAG ACGGGATGCT GGACCTCATC 4020 

TTGTCCCATG GAGAGTCCAT GGCTCAGCCG CTGTCCGTCT TCCGGGGCAA TCAGGGCTTC 4080 

AACAACAACT GGCTGCGAGT GGTGCCACGC ACCCGGTTTG GGGCCTTTGC CAGGGGAGCT 4140 

AAGGTCGTGC TCTACACCAA GAAGAGTGGG GCCCACCTGA GGATCATCGA CGGGGGCTCA 4200 

GGCTACCTGT GTGAGATGGA GCCCGTGGCA CACTTTGGCC TGGGGAAGGA TGAAGCCAGC 4260 

AGTGTGGAGG TGACGTGGCC AGATGGCAAG ATGGTGAGCC GGAACGTGGC CAGCGGGGAG. 4320 

ATGAACTCAG TGCTGGAGAT CCTCTACCCC CGGGATGAGG ACACACTTCA GGACCCAGCC 4380 

CCACTGGAGT GTGGCCAAGG ATTCTCCCAG CAGGAAAATG GCCATTGCAT GGACACCAAT 4440 

GAATGCATCC AGTTCCCATT CGTGTGCCCT CGAGACAAGC CCGTATGTGT CAACACCTAT 4500 

GGAAGCTACA GGTGCCGGAC CAACAAGAAG TGCAGTCGGG GCTACGAGCC CAACGAGGAT 4560 

GGCACAGCCT GCGTGGGTAC TGAGCTAGGC TCTAGGCATA CAATGACGTG GAAACCAAGG 4620 

CCCAAAAAGG AGCTGCAACT TTCCCAAGGC ATCTGCACCC CCGTCTGGTC CTTTTTCCTG 4680 

CCGGGTTGCC GGCTGCTCCT CAAAAGAGCT CAGCTCCAGG CTGCTCCCAG CACCCTTCTC 4740 
CAGAAAGCTC CAGGTATTCC AGAAGCCCAA GTGTATGAAC AAGATCAGGA ATAA 



Seq ID NO: 60 Protein Sequence 
Protein Accession Jh FGENESH 



MACPGGLPAR 
SPYYALRDRQ 
PPTTPAGLLG 
GVATYTDKLF 
GNVGPDALIE 
GGDPEEADEE 
SKSHLADKNL 
PHPRAPGMDP 
ELGGPWSQAT 
GPGRVAKREI 
LAWNQMEKEE 
SATHCGSMSF 
AFIVHLKYHL 
FLTQGLASSA 
LSSERVNVGV 
SPKFSMPSPV 
GQGEGLRIRR 
KGKGNVAQSV 
RGPITTRKRG 
RKGLRAPITT 
NHYQEKGLQG 
AMGSNHYQEK 
GRGTGGWTD 
KWLYTKKSG 
MNSVLEILYP 
GSYRCRTNKK 
PGCRLLLKRA 



11 
I 

CSGWMGU5GP 
GNAIGVTACD 
LPPfcSGRDFS 
KFRNNRWEDI 
MDPEASDLSR 
HSGDGSTSQL 
FGPPCYYSVC 
KCKGRHAEPG 
QHLPARELYD 
GRETGAVGRP 
GKIHGDHEPR 
LGGRGVSVGP 
CRDFPHSLCH 
HRRTLSLQGS 
DDPKQHGRGV 
RTVITADFDN 
GGFPGPGGQA 
PRTQAPQDTK 
YGVQSLPGKG 
RKRGYGVQSL 
PITTRKRGYR 
GLRAPITTRK 
FDGDGMLDLI 
AHLRIIDGGS 
RDEDTLQDPA 
CSRGYEPNED 
QLQAAPSTLL 



21 
I 

SGSSPASPPH 
IDGDGREEIY 
SSLGQASPDS 
LSDEVNVARG 
GILALRDVAA 
CRLGWKDGQF 
APSPAHPFPA 
LMAEALGAWP 
LGEPPILQRT 
LSHPLVPNFP 
FRLRKAREAE 
ILSSSASDIF 
LAETGPSSSC 
QGAPPCLLAR 
ALADFNRDGK 
DQELEIPFNN 
KVNTGPLMKK 
PHYHKKGLQG 
ATGSNHYQEK 
PGKGATGSNH 
VQSLPCjKGAT 
RGYGVQSIiPQ 
LSHGESMAQP 
GYLCEMEPVA 
PLECGQGFSQ 
GTACVGTELG 
QKAPGIPEAQ 



31 
I 

SSSRYNGPNL 
FliNTNNAFSG 
RQGERVPVPC 
VASLFAGRSV 
EAGVSKYTEG 
KEEAAALVEE 
RQAPQHYPVA 
ALSTTWPGG 
DGDPGRRRDS 
SCLRPLEAGT 
FPPGSSEEPL 
CDNENGPNFL 
CPWHARLLQA 
APCVLGSLIP 
VDIVYGNWNG 
IAYRSSSANR 
QKGRKDEDWA 
PITTRKRGYG 
GLOGPITTRK 
YQEKGLRGPI 
GSNHYQEKGL 
KGATGSNVIR 
LSVFRGNQGF 
HFGLGKDEAS 
QENGHCMDTN 
SRHTMTWKPR 
VYEQDQE 



41 

I 

VLKYDRAQKR 
HSSSAQVPSG 
CRGGLRPTHE 
ACVDRKGSGR 
FSHTASPSIG 
QREAGAAGVP 
PLVTQLMTHG 
LRSWEESRQK 
PKVTQECHLV 
VPGAALPGNP 
LQFPSGLRGS 
FHNRGDGTFV 
PHCHHGLSMS 
TAYYIVLWSA 
PHRLYLQMST 
LFRCSILARG 
RGCGNAGQSL 
VQSLPGKGAT 
RGYGLQSLPG 
TTRKRGYGLQ 
RGPITTRKRG 
REHGDPLIEE 
NNNWLRWPR 
SVEVTWPDGK 
ECIQFPFVCP 
PKKELQLSQG 



51 
I 

LVNIAVDERS 

IiHRNRPVLKP 

PEPFLLRPKS 

YSIYIANYAY 

EISGRTEERE 

RGRVRTALQT 

RLAGKLARSV 

GQAMSRCALR 

ATMPALGGIiE 

GNWVIiDMAKA 

PVLQVGLGLA 

DAAASAERRL 

FTRTGSRFYS 

IPESLMTHSY 

HGKVRFRDIA 

SSSLTAGGRN 

AKEPASAIAG 

GSNHYQEKGL 

KGATGSNHYH 

SLPGKGATGS 

YGLQSLPGKE 

LNPGDALEPE 

TRFGAFARGA 

MVSRNVASGE 

RDKPVCVNTY 

ICTPVWSFFL 



Seq ID NO: 61 DNA sequence 

Nucleic Acid Accession #: NM_0 00 584.1 

Coding sequence: 75.. 374 

1 U 21 31 41 SI 

AGCAGAGCAC ACAAGCTTCT AGGACAAGAG CCAGGAAGAA ACCACCGGAA GGAACCATCT 
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120 
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300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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CACTGTGTGT AAACATGACT TCCAAGCTGG COGTGGCTCT CTTGGCAGCC TTCCTGATTT 120 

CTGCAGCTCT GTGTGAAGGT GCAGTTTTGC CAAGGAGTGC TAAAGAACTT AGATGTCAGT 180 

GCATAAAGAC ATACTCCAAA CCTTTCCACC CCAAATTTAT CAAAGAACTG AGAGTGATTG 240 

AGAGTGGACC ACACTGCGCC AACACAGAAA TTATTGTAAA GCTTTCTGAT GGAAGAGAGC 300 

TCTGTCTGGA CCCCAAGGAA AACTGGGTGC AGAGGGTTGT GGAGAAGTTT TTGAAGAGGG 360 

CTGAGAATTC ATAAAAAAAT TCATTCTCTG TGGTATCCAA GAATCAGTGA AGATGCCAGT 420 

GAAACTTCAA GCAAATCTAC TTCAACACTT CATGTATTGT GTGGGTCTGT TGTAGGGTTG 480 

CCAGATGCAA TACAAGATTC CTGGTTAAAT TTGAATTTCA GTAAACAATG AATAGTTTTT 540 

CATTGTACCA TGAAATATCC AGAACATACT TATATGTAAA GTATTATTTA TTTGAATCTA 600 

CAAAAAACAA CAAATAATTT TTAAATATAA GGATTTTCCT AGATATTGCA CGGGAGAATA 660 

TACAAATAGC AAAATTGAGC CAAGGGCCAA GAGAATATCC GAACTTTAAT TTCAGGAATT 720 

GAATGGGTTT GCTAGAATGT GATATTTGAA GCATCACATA AAAATGATGG GACAATAAAT 780 

TTTGCCATAA AGTCAAATTT AGCTGGAAAT CCTGGATTTT TTTCTGTTAA ATCTGGCAAC 840 

CCTAGTCTGC TAGCCAGGAT CCACAAGTCC TTGTTCCACT GTGCCTTGGT TTCTCCTTTA 900 

TTTCTAAGTG GAAAAAGTAT TAGCCACCAT CTTACCTCAC AGTGATGTTG TGAGGACATG 960 

TGGAAGCACT TTAAGTTTTT TCATCATAAC ATAAATTATT TTCAAGTGTA ACTTATTAAC 1020 

CTATTTATTA TTTATGTATT TATTTAAGCA TCAAATATTT GTGCAAGAAT TTGGAAAAAT 1080 

AGAAGATGAA TCATTGATTG AATAGTTATA AAGATGTTAT AGTAAATTTA TTTTATTTTA 1140 

GATATTAAAT GATGTTTTAT TAGATAAATT TCAATCAGGG TTTTTAGATT AAACAAAGAA 1200 

ACAATTGGGT ACCCAGTTAA ATTTTCATTT CAGATAAACA ACAAATAATT TTTTAGTATA 1260 

AGTACATTAT TGTTTATCTG AAAGTTTTAA TTGAACTAAC AATCCTAGTT TGATACTCCC 1320 

AGTCTTGTCA TTGCCAGCTG TGTTGGTAGT GCTGTGTTGA ATTACGGAAT AATGAGTTAG 1390 

AACTATTAAA ACAGCCAAAA CTCCACAGTC AATATTAGTA ATTTCTTGCT GGTTGAAACT 1440 

TGTTTATTAT GTACAAATAG ATTCTTATAA TATTATTTAA ATGACTGCAT TTTTAAATAC 1500 

AAGGCTTTAT ATTTTTAACT TTAAGATGTT TTTATGTGCT CTCCAAATTT TTTTTACTGT 1560 

TTCTGATTGT ATGGAAATAT AAAAGTAAAT ATGAAACATT TAAAATATAA TTTGTTGTCA 1620 
AAGTAAAAAA AAAAAAAAA 

Seq ID NO: 62 Protein Sequence 
Protein Accession #; NP_000575.1 

1 11 21 31 41 51 

i 1 I II I 

MTSKLAVALL AAFLISAALC EGAVIiPRSAK ELRCQCIKTY SKPFHPKFIK ELRVIESGPH 60 
CANTEIIVKL SDGRELCLDP KENWVQRWE KFLKRAENS 

Seq ID NO: 63 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 52-1146 

1 11 21 31 41 51 

I I I I I I 

GTTACCCAGC ATTGTGAGTG ACAGAGCCTG GATCTGAACG CTGATCCCAT AATGCATCCT 60 

CAAGTGGTCA TCTTAAGCCT CATCCTACAT CTGGCAGATT CTGTAGCTGG TTCTGTAAAG 120 

GTTGGTGGAG AGGCAGGTCC ATCTGTCACA CTACCCTGCC ACTACAGTGG AGCTGTCACA 180 

TCCATGTGCT GGAATAGAGG CTCATGTTCT CTATTCACAT GCCAAAATGG CATTGTCTGG 240 

ACCAATGGAA CCCACGTCAC CTATCGGAAG GACACACGCT ATAAGCTATT GGGGGACCTT 300 

TCAAGAAGGG ATGTCTCTTT GACCATAGAA AATACAGCTG TGTCTGACAG TGGCGTATAT 360 

TGTTGCCGTG TTGAGCACCG TGGGTGGTTC AATGACATGA AAATCACCGT ATCATTGGAG 420 

ATTGTGCCAC CCAAGGTCAC GACTACTCCA ATTGTCACAA CTGTTCCAAC CGTCACGACT 480 

GTTCGAACGA GCACCACTGT TCCAACGACA ACGACTGTTC CAATGACGAC TGTTCCAACG 540 

ACAACTGTTC CAACAACAAT GAGCATTCCA ACGACAACGA CTGTTCTGAC GACAATGACT 600 

GTTTCAACGA CAACGAGCGT TCCAACGACA ACGAGCATTC CAACAACAAC AAGTGTTCCA 66.0 

GTGACAACAA CTGTCTCTAC CTTTGTTCCT CCAATGCCTT TGCCCAGGCA GAACCATGAA 720 

CCAGTAGCCA CTTCACCATC TTCACCTCAG CCAGCAGAAA CCCACCCTAC GACACTGCAG 780 

GGAGCAATAA GGAGAGAACC CACCAGCTCA CCATTGTACT CTTACACAAC AGATGGGAAT 840 

GACACCGTGA CAGAGTCTTC AGATGGCCTT TGGAATAACA ATCAAACTCA ACTGTTCCTA 900 

GAACATAGTC TACTGACGGC CAATACCACT AAAGGAATCT ATGCTGGAGT CTGTATTTCT 960 

GTCTTGGTGC TTCTTGCTCT TTTGGGTGTC ATCATTGCCA AAAAGTATTT CTTCAAAAAG 1020 

GAGGTTCAAC AACTAAGTGT TTCATTTAGC AGCCTTCAAA TTAAAGCTTT GCAAAATGCA 1080 

GTTGAAAAGG AAGTCCAAGC AGAAGACAAT ATCTACATTG AGAATAGTCT TTATGCCACG 1140 

GACTAAGACC CAGTGGTGCT CTTTGAGAGT TTACGCCCAT GAGTGCAGAA GACTGAACAG 1200 

ACATCAGCAC ATCAGACGTC TTTTAGACCC CAAGACAATT TTTCTGTTTC AGTTTCATCT 1260 

GGCATTCCAA CATGTCAGTG ATACTGGGTA GAGTAACTCT CTCACTCCAA ACTGTGTATA 1320 

GTCAACCTCA TCATTAATGT AGTCCTAATT TTTTATGCTA AAACTGGCTC AATCCTTCTG 1380 

ATCATTGCAG TTTTCTCTCA AATATGAACA CTTTATAATT GTATGTTCTT TTTAGACCCC 1440 

ATAAATCCTG TATACATCAA AGAGAATAGT CCCTGGAAAC ATAGCAAATG AACTTCTATC 1500 

TTGGCCATCA CAGCTGTCCA GAAGAGGGGA ATCTGTCTTA AAAACCAGCA AATCCAACGT 1560 

GAGACTTCAT TTGGAAGCAT TGTATGATTA TCTCTTGTTT CTATGTTATA CTTCCAAATG 1620 

TTGCATTTCC TATGTTTTCC AAAGGTTTCA AATCGTGGGT TTTTATTTCC TCCGTGGGGA 1680 

AACAAAGTGA GTCTAACTCA CAGGTTTAGC TGTTTTCTCA TAACTCTGGA AATGTGATGC 1740 

ATTAAGTACT GGATCTCTGA ATTGGGGTAG CTGTTTTACC AGTTAAAGAG CCTACAATAG 1800 

TATGGAACAC ATAGACACCA GGGGAAGAAA ATCATTTGCC AGGTGATTTA ACATATTTAT 1860 

GCAATTTTTT TTTTTTTTTT TGAGATGGAG CTTTGCTCTT GTTGCCCAGG CTGGAGTGCG 1920 

ATGGTGAAAT CTCGGCTCAC TGTAACCTCC ACCTTCCGGG TTCAAGCAAT TCTCCCGCCT 1980 

CAG CCTC ACG AGTAGCTGGG ATTACAGGTG TCCGCCACTA TGCTCAGCTA ATTTTTTGTA 2040 

TTTTTTTAGT AGAGACAGGG TTTCATCATG TTGGCCAGAA TGATCTCGAA CTCCTGACCT 2100 

CAGAGGATCC AGCCTCCTTG GCCTTCCAAA GTGCTGGGAT TACAGGCATG AGCCACCGTG 2160 
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CCTGGTCCAT 
CATCAGGCTT 
AGGCGCGAGT 
GTTTGAAACT 
AAAATTGTTT 
CAAGGTGGGC 
CCCCATCTCT 
AGGCAAAGTG 
AATTCAATTA 
TTATACTAAA 
ATGCCTGTAA 
TTGAGACCAG 
GGCTAGATTT 
CAATTTCAGT 
CATAGTTGTC 
GTTTTTTTGA 
GGCTCACTGC 
AGCTGGGATT 
ACCATATTGG 
CCCCAAAGTG 
AACGCAGTTA 
AAGATTGTGA 
ATTTTATTTT 
ATTGGAGATT 
GGACTGGATT 
ATCCTTATGT 
CTTTATGGCA 
CAGGAATCTG 
GTCACTAGTA 
CAGGATTAGG 
TAGAGCAAAT 
TCAACTATGA 
GTAAAAAACA 
GCTTGGTAAA 
AGTGTAGTG 



TTATGCAATT 
TCTGAACAAA 
AGGGAAAAGT 
CACTAGTTTT 
AGGAGGCTAG 
AGGTAGCTTG 
ATTAAGAAAA 
TCTTTCTCAT 
GGCTTATTGT 
CCAGTATAAT 
TCCCAGGGCT 
CCTGACCAAC 
GTCAGAAAAG 
TTGAGTAAAC 
TGGGGCAACA 
GATGGAGTTT 
AACCTCCACC 
ACAGATGCCC 
CCAGGCTGGC 
TTGGGATTAC 
TCATTCTAGG 
TACATGGTAT 
TTAAAAGCTA 
TGAAATTAAA 
GCTATGTGAC 
TCCCTGCATG 
ACTGCCCTAT 
GTTAGCTTTG 
GCATTCTGTG 
GATAGGACTT 
GGTTCTTTAC 
TGTACAGTGC 
CTACAGATAC 
TAATTTTTTT 



TTTATTACAC 
TTTTTAGAAC 
CTGCCTTTTC 
GACCCAAATT 
CGCCGTGGCT 
AGCCCAGAAG 
AAATTTAAGG 
ATCTAATCAA 
AAGGATATCT 
TGTTTAGAAG 
TTGGAAGGCT 
ATGGAGAAAC 
AAACATTTCT 
TCTGAAATAA 
GGATGCTTGC 
TGCTCTTGTC 
TCCCAGGTTC 
GCCACAACGC 
CTCCAACTCC 
AGGCATAAGC 
AAATTTATAG 
TTCAGTTTTC 
TAGTGTCTTT 
GCCAAGATTT 
CAGGGTCCCA 
CTCACCTTTG 
TCTCTGGATC 
CTTAACACAT 
CAGTACAATG 
GAGTCTAATC 
GTCTACATTT 
TAATTCCAAT 
TGAAGCAGTT 
TTTCTTTTGA 



GTGTGTCATG 
CAAGAAGATC 
TGCCTTATAA 
CTAAATCGCC 
CATGCCTGTA 
TTTGAGACCA 
AAAAAAAAAA 
TTCCTGTATT 
TTAGTCAGTA 
TATATATTTG 
GAGGTGAGTG 
ACTGTTCTCT 
ACATGAACAG 
TTCATGGTTG 
TTTCAGAGGA 
ACCCAGGCTG 
AAGTGATTCT 
CCAGCTTATT 
TGACCTCAGG 
CACCGCGCCC 
GTCCTTTGAA 
CCAAATGTGG 
TTTTCTTAAA 
GTAGTTTCTC 
CTAGATGGGG 
TTACCTGCCT 
CTTCCTGAGG 
TTCCAAAACT 
TATGGGGGCT 
CTAACTCTTA 
TCTTCATCTG 
GAAATGTTAC 
TGGAGAATTA 
GACAGAGTCT 



GTGCTAAGCA 
AGCAATAAAA 
GTTACATTGC 
TCCAAACCAC 
ATCCCAGCAC 
GCCTGGGCAA 
AAGATAAAAT 
TTTTTTTTTT 
TAAAAATATA 
CCGGCCGGGC 
GATCACTAGA 
ACTAAGTATA 
TTAAGGCTGG 
AGGTTCAGAG 
AACTTTTTGT 
AAATGCAATG 
CCTGCCTCAG 
TTTAGTAGAG 
TGAGCCACCC 
AGCCAGAGGC 
GGAAAATTCT 
CCAGCCCGAT 
TTTGAGGCAA 
TGGAAAGACC 
CTGCATCCTC 
GACACCTGTG 
ATTTATGATG 
TGTTTGAATG 
TAGGAGTTTA 1 
GCAGTTACAC 
TAGATGTAAT 
ATGTGAGAAG 
AAAAACACTA 
GGCTCTGTCA 



TTTAAACAAC 
GAAAAAAAAA 
AGATATAAGT 
TTTCATGTTT 
TTTGTGAAGC 
CATGGCAATA 
TGTTTTGGAA 
TTTTTTTTAC 
TAAAAAACAA 
ATGGTGGCTC 
GGTCGGGAGT 
GATTTGCAAG 
GTATTAATTT 
CTACCCATAC 
TTTTGTTTTT 
GCACAATCTT 
CCTCCTGAGT 
ACAGGGTTTC 
GCTTTGGCCT 
AACATTTTTT 
GTGGGCAAAT 
CTGGTCAAAA 
CATGCACAAA 
TGGCAAGATT 
TAATCCCCAA 
GGGCTTTTAA 
CGTAATACTC 
CATGAGTACA 
GGGTAGTATA 
TGGATGACAT 
AATTTCCATA 
TCTTTGAAAT 
CGAAAACACA 
TCCAGGCTAG 



2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 



Seq ID NO: 64 Protein sequence: 
Protein Accession #: NP_03633 8.1 



I 

MHPQWIIiSL 
IVWTNGTHVT 
SLEIVPPKVT 
TTTSVPTTTS 
IRREPTSSPL 
VL1ALLGVII 



11 
1 

ILHLADSVAG 
YRKDTRYKLL 
TTPIVTTVPT 
IPTTTSVPVT 
YSYTTDGNDT 
AKKYFFKKEV 



21 
I 

SVKVGGEAGP 
GDLSRRDVSL 
VTTVRTSTTV 
TTVSTPVPPM 
VTESSDGLWN 
QQLSVSFSSL 



31 
I 

SVTLPCHYSG 
TIENTAVSDS 
PTTTTVPTTT 
PLPRQNHEPV 
NNQTQLFLEH 
QIKALQNAVE 



41 
I 

AVTSMCWNRG 
GVYCCRVEHR 
VPTTMSIPTT 
ATSPSSPQPA 
SLLTANTTKG 
KEVQAEDNIY 



51 

I 

SCSLFTCQNG 
GWFNDMKITV 
TTVPTTMTVS 
ETHPTTLQGA 
IYAGVCISVL 
IENSLYATD 



60 
120 
180 
240 
300 



Seq ID NO: 65 DNA sequence 

Nucleic Acid Accession 8: EOS sequence 

Coding sequence: 2 932-4764 



I 



AGAAGAAAGC 
CCAGAGAAGG 
AACTGGGAGG 
GAAGGAGCAC 
GACCTGAAGT 
GCACCCATCC 
CCTGGGGAGG 
CTAGGCCAGC 
GATGAGCCTG 
GCCCTGGCCC 
ACTGTGCTGG 
GTGGGGGGGC 
GAGTACTTCT 
ATCACGGTGA 
GCAGGACTAA 
ACCCGGATAG 
TCGTCCCCTT 
CCGCATGGAC 
TGGCACTTCC 
TAGCCTCACG 
GCCCTTAGAT 
CGCCGGGACC 
CCCTCCAATG 
GAGGTGGGGA 



11 
1 

TGGTAGGGGC 
TTCCCTGAGG 
AGAGAATGTT 
AGCTGGGGTG 
ACCAGCCCCC 
AGGACAGCTT 
CCCAGCTGGA 
AGGACGATGA 
ACGTGGAACT 
ATGGGGATCC 
AAGCCCAGAA 
AGCGCCGTGT 
TGTTCGAATT 
GTGGGGTAGG 
CACCACCTTC 
GCACCTTCAG 
ATTGAGACTC 
CCCTATACAC 
GCCCCTCCCT 
GCCACCCTGT 
GGCCAGTTCT 
AAGGGTTTCA 
CTACCCCCGG 
GGAGGACATG 



21 
I 

TGGGAGAGGG 
AAATAGTACC 
CCAGGCAGAG 
CCTGGAGTGA 
AGAGGGCGCT 
CGAGTTAATC 
GCGGCGGGCA 
AGAGAATGAG 
TTCTGGTGTT 
CTTCCAGGTG 
ACTGGTGGGA 
GACCGCCACA 
TCATGACACG 
GGTGACCAGT 
CCCCAAGGCT 
GATGGACCTG 
TGCACGGACA 
TCACTTCGGA 
GCTGAGCCAG 
GCGTCCCGCC 
ACCAAAGATG 
TTAAGGTCAC 
CCCCAGGGCA 
GATCCGGGGG 



31 

I 

TACCACAGGG 
TGAACTTAGA 
GAAAAGGCAT 
GATGGAGCTG 
ACTGGAGCCT 
ATCCCCAATG 
GTGGCTCTAG 
CTGGAGCTTG 
ATGTTCAGCC 
TCCAGAGCTC 
GTCAACATTA 
CAGCGTGGGA 
CGGCTTCGTC 
GTCCTTCAGA 
TTCCATTCGC 
GGCATCATCT 
AGGGCCCTAG 
GAGGGCCATC 
AGAAGCCCTG 
GGTCGCCCCT 
GGTTCCGCTG 
CTTGTCCGTG 
CTGTTCGGAC 
TGGCCGTGGG 



41 



GAGTATGATC 
CTTGAAGGAT 
ATGCAAAAGT 
GAAAGATCCA 
GGTCAGAGGA 
TGGGCTTCCA 
GCCGCAGGCT 
AGCTGGAGCA 
CCCTCAAGAG 
AAGACTTCCA 
ACCCCTATGT 
CCAGTTGCCC 
TCCAAGACTT 
GAAGGGGGGA 
AGACCCTCCC 
TGGACCAGCC 
AGATTGACCC 
TCTGGCGGAG 
GCCATTGTCC 
TACCCCTGGC 
CATGATCCCC 
AGGGCGCGCG 
ATCGAGAAGT 
GCGCGGATAA 



51 
I 

TACTTGGGGG 
AACAGATGTT 
CCAGCGCCTT 
GGTGGAGCTT 
GGACTTTGGG 
GGAACTGGAG 
AGCTCGAAGT 
GGACCTGGAT 
CCGCGCCAGG 
GGTGGGAGTC 
GGCCGTGCAA 
CTTCTACAAT 
GCTGCTGGAG 
TGAGAAAGCT 
CTTTATGGCC 
AGGTATGGAA 
TGCAGTGACT 
GCTGAACTCT 
GTCACTCCGA 
TCGCCCCTTC 
GAGACACCCG 
GGGACCTGCC 
GAGCCGGGGT 
GGGGAGGGGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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CGAGATCCCA GTTTCTCCCC CCCCGCTCGG TGCCCCCTCC CCTAGGAACC TGCTCCTGCC 1500 

GCGCGGGGTG CCCGCCGAGA GGCCATGGGC GCGGCTCCGC GTGCGCCTGT ACCGCGCCGA 1560 

GGGGCTTCCC GCGCTGCGCC TGGGGCTGCT GGGCAGCCTG GTCCGCGCCC TGCACGACCA 1620 

GCGCGTCCTG GTGGAGCCCT ATGTGCGGGT GTCTTTCCTG GGGCAGGAGG GCGAGACGTC 1680 

GGTGAGCGCC GAGGCGGCGG CGCCCGAATG GAACGAGCAG CTGAGCTTCG TGGAGCTCTT 1740 

CCCGCCGCTG ACGCGCAGCC TCCGCCTGCA GCTGCGGGAC GACGCGCCCC TGGTCGACGC 1800 

GGCACTCGCT ACGCACGTGC CGGACCTGAG GCGGATCTCC CATCCGGGCC GCGCGGCGGG 1860 

GTTTAACCCT ACCTTCGGCC CGGCCTGGGT GCCCCTCTAT GGCTCGCCCC CCGGCGCGGG 1920 

GCTCCGGGAT AGTCTTCAAG GTCTCAACGA AGGCGTTGGC CAAGGCATTT GGTTCCGCGG 1980 

CCGCCTTCTG CTGGCTGTGT CCATGCAGGT GTTGGAAGGG AGAGCTGAAC CTGAGCCTCC 2040 

CCAGGCCCAG CAGGGGTCCA CGTTGTCCCG GCTCACCCGA AAGAAGAAAA AGAAAGCCAG 2100 

AAGGGATCAG ACCCCAAAGG CGGTTCCGCA GCACTTGGAC GCCAGCCCCG GTGCCGAGGG 2160 

GCCTGAGATC CCCCGTGCCA TGGAGGTGGA GGTGGAGGAG CTGCTGCCGC TGCCAGAGAA 2220 

TGTCCTGGCG CCCTGTGAAG ATTTCCTGCT TTTCGGTGTG CTCTTCGAGG CCACCATGAT 2280 

CGACCCCACC GTGGCCTCCC AGCCCATCAG CTTCGAGATC TCCATTGGTG TGTGGCCTAG 2340 

CCGAACCCCT GAGTGCCATT TCAGACCTTA GAACCCTGGA AGGGGTGTTG ACTTTCAGTC 2400 

GCGCAGGCCG TCTGGAGGAG CAATTGGGCC GAGGGTCCAG GGCTGGGGAG GGAACTGAGG 2460 

GTGCAGCCGT GGAGGCTCAG CCTCTGCTGG GAGCCAGGCC AGAGGAGGAG AAAGAGGAGG 2520 

AAGAACTGGG GACCCATGCT CAGCGGCCTG AGCCCATGGA CGGCAGTGGG CCATACTTCT 2580 

GCTTGCCCCT CTGTCACTGC AAGCCATGCA TGCATGTGTG GAGTTGCTGG GAGGACCACA 2640 

CCTGGCGCCT GCAGAGCAGC AACTGCGTGC GCAAAGTGGC CGAGAGGCTG GACCAGGGGC 2700 

TGCAGGAGGT TGAGAGACTG CAGCGCAAGC CGGGGCCTGG CGCCTGTGCA CAGCTCAAGC 2760 

AGGCACTGGA AGTACTGGTG GCTGGGAGCA GACAGTTTTG CCACGGTGCC GAGCGCAGGA 2820 

CGATGACCCG GCCCAATGCC CTGGATCGAT GCCGAGGGAA ACTCCTGGTG CACAGCCTGA 2880 

ACCTTTTGGC TAAGCAAGGA CTGCGACTTC TACGCGGCCT GAGACGGCGC AATGTGCAAA 2940 

AGAAGGTGGC ACTGGCCAAG AAGCTCCTGG CAAAACTGCG CTTTCTGGCT GAGGAGGCAC 3000 

CCGGGGCAGC CCCTGGTGAG GTCTGTGCCA AGCTGGAGCT CTTCCTGCGG CTGGGCCTGG 3060 

GCAAGCAAGC CAAGGCCTGC ACCTCTGAGC TGCCCCCGGA TTTGCTGCCC GAGCCCTCAG 3120 

CCGGGCTGCC CTCCAGCCTA CACCGGGACG GTCCTGGAGC AGACGCTGAG CCCTCTGTGG 3180 

GATGAACTCC TGGTATTTGA GCAGTTGATC GTGGATGGGA GGAGGGAGCA CCTGCAGGAG 3240 

GAGCCTCCAT TAGTGATCAT CAATGTATTT GACCACAATA AGTTTGGCCC CCCCGTGTTC 3300 

CTGGGCAGGG CACTGGCCGC CCCAAGGGTA AAGCTGATGG AGGACCCATA CCAACGCCCA 3360 

GAGTTGCAGT TCTTCCCCCT GAGGAAGGGA CCCTGGGCAG CCGGAGAGCT CATTGCCGCC 3420 

TTTCAACTCA TTGAACTAGA CTACAGTGGC CGACTTGAGC CCTCAGTGCC CAGTGAGGTG 3480 

GAGCCCCAGG ATCTGGCACC CCTGGTTGAG CCCCACTCTG GACGCCTGTC CCTTCCACCC 3540 

AACGTGTGCC CAGTGCTCAG GGAGTTCCGT GTTGAGGTGC TGTTCTGGGG TCTTAGGGGA 3600 

CTTGGTCGTG TGCATCTGCT CGAGGTGGAG CAGCCCCAGG TTGTACTGGA GGTGGCTGGG 3660 

CAAGGTGTGG AGTCTGAGGT CCTGGCCAGC TACCGTGAGA GCCCCAATTT CACTGAGCTT 3720 

GTCAGGCATC TGACAGTGGT CTTCAAAGAC ACAGCTCCTC TCTTCCACCC CCAGGACTTG 3780 

CCGGAGCAGC CTTACTTGCA GCCTCCACTC AGCATCTTGG TGATTGAGCG CCGGGCCTTT 3840 

GGCCACACAG TCCTTGTGGG TTCCCACATT GTCCCCCACA TGCTGCGATT CACATTTCGG 3900 

GGTCATGAGG ATCCTCCTGA GGAGGAAGGA GAGATGGAGG AGACAGGGGA TATGATGCCC 3960 

AAGGGACCTC AAGGACAGAA GTCCCTGGAT CCCTTCTTGG CTGAAGCGGG TATATCCAGA 4020 

CAGCTCCTGA AGCACAACTT TGATGAAGAT GAAATGGATG ATCCTGGAGA TTCAGATGGG 4080 

GTCAACCTCA TTTCTATGGT TGGGGAGATC CAAGACCAGG GTGAGGCTGA AGTCAAAGGC 4140 

ACTGTGTCCC CAAAAAAAGC AGTTGCCACC CTGAAGATCT ACAACAGGTC CCTGAAGGAA 4200 

GAATTTAACC ACTTTGAAGA CTGGCTGAAT GTGTTTCCTC TGTACCGAGG GCAAGGGGGC 4260 

CAGGATGGAG GTGGAGAAGA GGAAGGATCT GGACACCTTG TGGGCAAGTT CAAGGGCTCC 4320 

TTCCTCATTT ACCCTGAATC AGAGGCAGTG TTGTTCTCTG AGCCCCAGAT CTCCCGGGGG 4380 

ATCCCACAGA ACCGGCCCAT CAAGCTCCTG GTCAGAGTGT ATGTTGTAAA GGCTACCAAC 4440 

CTGGCTCCTG CAGACCCCAA TGGCAAAGCA GACCCTTACG TGGTGGTGAG CGCTGGCCGG 4500 

GAGCGGCAGG ACACCAAGGA ACGCTACATC CCCAAGCAGC TCAACCCCAT CTTTGGAGAG 4560 

ATCCTGGAGC TAAGCATCTC TCTCCCAGCT GAGACGGAGC TGACGGTCGC CGTATTTGAT 4620 

CATGACCTCG TGGGTTCTGA CGACCTCATC GGGGAGACCC ACATTGATCT GGAAAACCGA 4680 

TTCTATAGCC ACCACAGAGC AAACTGTGGG CTGGCCTCCC AGTATGAAGT GTGGGTCCAG 4740 

CAGGGCCCAC AGGAGCCATT CTGAGTTTCT GGCCAAACAC ATTCAAGCTC ACATTCCCTT 4800 

TTGTGTCTCC AGATCCTATG ATTTCATGGA AGGGGACCCT CCCACCCACC GCCACTGCCA 4860 

ACCAAGACAT AGCTCAGTGG TCAAGACTTG GGCTTGGGAG TCGGGATCCT GTAACGAATG 4920 

TCACTTGACC GCTTTCTTTT TTTATGAAAC AGTCTCGCTC TGTCTCCCAG GTTGGAGTGC 4980 

AGTGGCACGA TCTCGGCTGA CTGCAACCTC CACCTCCTGG GTTCAAGCGA TTCTCCTGCC 5040 

TCAGCCTCCC CAGTAGCTGG GATTACAGGC GTGGGCCCCC ATGTCCAGCT AATTTTTATA 5100 

TTTTCGCTCT GTCTCCCAGG TTGGAGTGCA GTGGCACGAT CTCGGCTGAC TGCAACCTCC 5160 

ACCTCCTGGG TTCAAGCGAT TCTCCTGCCT CAGCCTCCCC AGTAGCTGGG ATTACAGGCG 5220 

TGGGCCCCCA TGTCCAGCTA ATTTTTATAT TTTTAGTAGA GACAGGGTTT CACCATGTTG 5280 

TCCAGGCTGG TCTTGAACCC CTGACCTCAA GTGATCCACC CACCTCTGCC TCCCAAAGTG 5340 

CTGGGATTAC AGGTGTGAGC CACCATGCCA GGCCCTCTTA ACCTCTTCAA GTCTGTTTTC 5400 

TCATCTGCAA AACAGAGGTA ATAAGATCAG TATCTTCTTA ATGGAAGCAC CTGGACTACA 5460 

TTTTTTTCAT TCATTGTTAT CATAAATGAG GACTAACCTG TCTCCCGTTG GGAGTTTTGA 5520 

ACCTAGACCT CATGTCTTCA TGACGTCATC ACTGCCCCAG GCCCAGCTGT GTCCCTACAC 5580 

CAGCCCCAGC TGACGCATCT TCTTTTTCTG CCTGTAGAGA TGGTTACAAT GCCTGGCGTG 5640 

ATGCATTCTG GCCTTCGCAG ATCCTGGCGG GGCTGTGCCA ACGCTGTGGC CTCCCTGCCC 5700 

CTGAATACCG AGCCGGTGCT GTCAAGGTGG GCAGCAAAGT CTTCCTGACA CCACCGGAGA 5760 

CCCTGCCCCC AGGGATCTCT TCACATGTGG ATTGACATCT TTCCTCAAGA TGTGCCTGCT 5B20 

CCACCCCCAG TTGACATCAA GCCTCGGCAG CCAATCAGCT ATGAGCTCAG AGTTGTCATC 5880 

TGGAACACGG AGGATGTGGT TCTGGATGAC GAGAATCCAC TCACCGGAGA GATGTCGAGT 5940 

GACATCTATG TGAAGAGCTG GGTGAAGGGG TTGGAGCATG ACAAGCAGGA GACAGACGTT 6000 

CACTTCAACT CCCTGACTGG GGAGGGGAAC TTCAATTGGC GCTTTGTGTT CCGCTTTGAC 6060 

TACCTGCCCA CGGAGCGGGA GGTGAGCGTC TGGCGCAGGT CTGGACCCTT TGCCCTGGAG 6120 
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GAGGCGGAGT TCCGGCAGCC TGCAGTGCTG GTCCTGCAGG ATCCCTGGAG TTGCAGCTAC 6180 

CAGACATGGT GCGTGGGGCC CGGGGCCCCG AGCTCTGCTC TGTGCAGCTG GCCCGCAATG 6240 

GGGCCGGGCC GAGGTGCAAT CTGTTTCGCT GCCGCCGCCT GAGGGGCTGG TGGCCGGTAG 6300 

TGAAGCTGAA GGAGGCAGAG GACGTGGAGC GGGAGGCGCA GGAGGCTCAG GCTGGCAAGA 6360 

AGAAGCGAAA GCAGAGGAGG AGGAAGGGCC GGCCAGAAGA CCTGGAGTTC ACAGACATGG 6420 

GTGGCAATGT GTACATCCTC ACGGGCAAGG TGGAGGCAGA GTTTGAGCTG CTGACTGTGG 6480 

AGGAGGCCGA GAAACGGCCA GTGGGGAAGG GGCGGAAGCA GCCAGAGCCT CTGGAGAAAC 6540 

CCAGCCGCCC CAAAACTTCC TTCAACTGGT TTGTGAACCC GCTGAAGACC TTTGTCTTCT 6600 

TCATCTGGCG CCGGTACTGG CGCACCCTGG TGCTGCTGCT ACTGGTGCTG CTCACCGTCT 6660 

TCCTCCTCCT GGTCTTCTAC ACCATCCCTG GCCAGATCAG CCAGGTCATC TTCCGTCCCC 6720 

TCCACAAGTG ACTCTCGCTG ACCTTGGACA CTCACCCAGG GTGCCAACCC TTCAATGCCT 6780 

GCTCCTGGAA GTCTTTCTTA CCCATGTGAG CTACCCCAGA GTCTAGTGCT TCCTCTGAAT 6840 
AAACCTATCA CAGCC 



Seq ID NO: 66 Protein sequences 
Protein Accession ft: EOS sequence 

X 11 21 31 41" 51 

| | | I ! I 

MCKRRWHWPR SSWQNCAFWL RRHPGQPLVR SVPSWSSSCG WAWASKPRPA PLSCPRICCP 60 

SPQPGCPPAY TGTVLEQTLS PLWDELLVFE QLIVDGRREH LQEEPPLVII NVPDHNKFGP 120 

PVFLGRALAA PRVKLMEDPY QRPELQFFPL RKGPWAAGEL IAAFQLIELD YSGRLEPSVP 180 

SEVEPQDLAP LVEPHSGRLS LPPNVCPVLR EFRVEVLFWG LRGLGRVHLL EVEQPQWLE 240 

VAGQGVESEV LASYRESPNF TELVRHLTW FKDTAPLFHP QDLPEQPYLQ PPLSILVIER 300 

RAFGHTVLVG SHIVPHMLRF TFRGHEDPPE EEGEMEETGD MMPKGPQGQK SLDPFLAEAG 360 

I SRQLLKHNF DEDEMDDPGD SDGVNLISMV GEIQDQGEAE VKGTVSPKKA VATLKIYNRS 420 

LKEEFNHFED WLNVFPLYRG QGGQDGGGEE EGSGHLVGKF KGSFLIYPES EAVLFSEPQI 480 

SRGIPQNRPI KLLVRVYWK ATNLAPADPN GKADPYVWS AGRERQDTKE RYIPKQLNPI 540 

FGEILELSIS LPAETELTVA VFDHDLVGSD DLIGETHIDL ENRFYSHHRA NCGLASQYEV 600 
WVQQGPQEPF 



Seq ID NO: 67 DNA sequence 

Nucleic Acid Accession #: NM_002449.2 

Coding sequence: 223.. 1026 



[ 



GGGGGGGGGG 
GAGGCCCGGC 
TCCCAGCGCG 
CAGCGGAGTC 
GGCAATGACT 
GGGCTGGGGG 
TTCAGCGTGG 
CCCGAAGGCG 
CGGGAAGCGC 
GGAAATTCAG 
AGACATATGA 
ACGCCCTTTA 
CTCTCCATTG 
AAAATCTGGT 
AAGCTGAAAA 
AGCTCGCCCC 
CTTCCCATCC 
TCCTAAGGAA 
TCCCTCTCCA 
ACCCCACCCC 
GTAGGCGGAG 
CCCTTGCACA 
AAAAATACTA 
AAATTTATAT 
AAGTCCATTT 
GTCTCCAGCC 
GTCAATGATT 
GCCCGCTGTG 
ACAATGCAGC 
CCACGGCGTC 
TATTTCTGCA 
GGTCAAGTCG 
TCCCGCCCAC 
GCCAAGACAT 
CCAATACCTG 
GCGCACGCCC 
ATACTTCCTA 
AGTACCTCTC 
GTTAAAACCT 
TCTGAGAAAT 
TAGTAATTTT 
TGAACAATTA 



11 
I 

GGCAGCCTCT 
GCGGATTGGC 
CCCCTCCCGT 
GCGCGTCGGG 
TGTTTTCGCC 
GCGCCGCGGG 
AGGCGCTCAT 
CCTCGGCCGG 
ACAGCCCCGG 
AAGATGGAGC 
GCCCTACCAC 
CCACATCCCA 
CAGAGCGTGC 
TCCAGAACCG 
TGGCTGCAAA 
TGCAGGCAGC 
CGCCCGTGGG 
GACCAGATCA 
CAAAGGCATA 
CTCTAACGGC 
GCACCAAGCC 
AAAGCTTGGC 
CAGACCCTTT 
ACATATGTGC 
GTTGTATTTT 
TGCCCTTCAG 
TTTACCGGGA 
CCTCCCGAAG 
AGCCAGAAAA 
CGGGAAGCGC 
TTTTACTTGC 
GGAAATTCAG 
GCCCACACAT 
ATGAGCCCTA 
GGATGAGAAG 
TTTACCACAT 
ATTCTTCTCA 
CATTGCAGAG 
GTGAGAGGAT 
ACTAGGTACA 
CTGCTTAAAA 
GTAGATCCAG 



21 



31 



CGGGAAGAGC 
GGCGCGCGTC 
CTCCGCAGCA 
AGCTACGTAG 
CGACGAGGAG 
GGCCGCGGAG 
GTCCGACAAG 
GGCCCACCTG 
GCCGCTGGTG 
GGCGTGGATG 
CTGCACCCTG 
GCTCCTCGCC 
AGAGTTCTCC 
AAGCGCCAAG 
ACCTATGCTA 
GTCCATATAC 
ACTCTATGCC 
ATAGACTCCA 
GCCAGCCAGT 
TAGCTGACAG 
CTGCTTTTCT 
TCTGATGGTT 
TTATCAGCAG 
TTTTTTTCTA 
CTTAAAGAGG 
CGTGGAGGCG 
TTATTGACTT 
GCGCCTCGGC 
TATTAGATAT 
ACAGCCCCGG 
ATTAAAAGAA 
AAGATGGAGC 
ATTTTTAAAG 
CCACCTGCAC 
AATCCGTAGA 
CCCAGCTCCT 
AGGCTGTTGG 
CGTGCAGAGT 
TGTACAGCTC 
TTCATCCTCA 
AATGAGTATC 
AAAGAAAAAA 



CAATCAGGGG 
TCCCACTTCC 
AAAAAGTTTG 
GGCAGAGAAG 
GGCCCAGCAG 
GAGCGCCGCG 
AAGCCGCCCA 
CGGCCACTGC 
AAGCCCTTCG 
CAGGAACCCG 
AGGAAACACA 
CTGGAGCGCA 
AGCTCTCTGA 
GCGAAAAGAC 
CCCTCCAGCT 
GCAGCATCCT 
ACGCCAGTGG 
TGATGGATGC 
ACTCCTGCGC 
GGCCACACCA 
TGGTGTAACT 
TTTTTGGCAT 
ACGTAAAAAT 
TATCTCACCT 
GAGACAAATT 
CTCATGTCCG 
CTGCTTATAC 
CGGCCTGCGG 
GGAGAGATTA 
GCCGCTGGTG 
ACCTCTTTAT 
GGCGTGGATG 
TTTTTAGGAA 
CCTGACCTTT 
CTGCCGGAAA 
CGCCCTGGAG 
TAACTTTGGA 
TCTCTATTTC 
TCTGAACCTC 
CAGATTGCAA 
TTGTAACCAT 
AAAATATGCT 



41 
1 

CGAGCGTCTT 
CCTCGGAGGA 
AGTCGCCGCT 
TCATGGCTTC 
TGGTGGCCGG 
TCAAGGTCTC 
AGGAGTCGCC 
TGCTGTCGGG 
AGACCGCCTC 
GCCGATATTC 
AGACCAATCG 
AGTTCQGTCA 
ACCTCACAGA 
TGCAGGAGGC 
TCAGTCTCCC 
ACCCGTTCCA 
GATATGGCAT 
TTGTTTCAAA 
TGCTAAGCCC 
CATAGCTGAA 
TCCAGAGTCC 
GATGTATATA 
TCAAATTATT 
TCCCAAAAAG 
CGGAGGAGCG 
ATTTGCAAAA 
AAGAAGCCGC 
AAAAACAAAA 
TGGCCACTGC 
ATCAAAGTGA 
AAGCCCTTCG 
CTACATACGT 
CCCGGCCGAT 
TTAAGAATAT 
CACAAGACCA 
GTGAGGTAGA 
GCGCAAGTTC 
AGATAATTGG 
ACAGACCCAG 
AGGTGCTTTG 
TACCTATATC 
TCTCTGTGTG 



51 



I 

CTCGTCGCAC 
AAGGCTCAGC 
GCCGGGTTGC 
TCCGTCCAAA 
ACCAGGCCCG 
CAGCCTGCCC 
CGCTGTGCCT 
GCACCGCGCT 
GGTCAAGTCG 
GCCGCCGCCA 
GAAGCCGCGC 
GAAACAGTAC 
GACCCAGGTC 
GGAACTGGAA 
CTTCCCCATC 
TAGACCTGTG 
GTACCACCTG 
GGGTTTCCTC 
TCGACGTTGC 
ATTTCGTTCT 
CCCCTTTTTT 
TATATATACG 
TTAAAAGGCA 
ACACATGTGT 
CCGCGTCAAG 
ATGTGCTAAA 
CCAAGGAGTC 
GAAAACAGAC 
TGCTGACCGG 
ACCCACATCA 
AGACCGCCTC 
TGTTCCTATC 
ATTCGCCGCC 
TTTTGTAAGA 
ATCGGAAGCC 
AAAATTAGAA 
CGTCAGAAAC 
AGAGTAAAAT 
GTCAAAAGGT 
GGTGGGGGTT 
TAAATATTCT 
TGTACCTGTT 
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10 



15 



GTATGTCCTA ACTTATTAGA AAAATTTTAT ATCTTTTTAC ATGTGGGGGG CAGAAGGTAA 2580 

AGCATGTTTG ACTTGTGAAA ATGGGATGTC AAACAGCCAT AAGTTCCCTG GTATTCACCT 2540 

TCCTGTCCAT CTGTCCCCTC CATCGGTATA CCTTTATCCC TTTGAAAGGG TGCTTGTACA 2700 

ATTTGATATA TTTTATTGAA GAGTTATCTC TTATTCTGAA TTAAATTAAG CATTTGTTTT 2760 
ATTCTGAATT AAATTAAGCA TTTGTTTTAT TGCAGTAAAG TTTGTCCA 

Seq ID NO: 68 Protein Sequence 
Protein Accession 8: NP_002440.l 

I 11 21 31 41 51 

II t I I I 
MASPSKGNDL FSPDEEGPAV VAGPGPGLGG AAGAAEERRV KVSSLPFSVE ALMSDKKPPK 60 
ESPAVPPEGA SAGAHLRPLL LSGHRAREAH SPGPLVKPFE TASVKSGNSE DGAAWMQEPG 120 
RYSPPPRHMS PTTCTLRKHK TNRKPRTPFT TSQLLALERK FRQKQYLSIA ERAEFSSSLN . 180 
LTETQVKIWF QNRSAXAKRL QEAELEKLKM AAKPMLPSSF SLPFPISSPL QAASIYAASY 240 
PFHRPVLPIP PVGLYATPVG YGMYHLS 
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Seq ID NO: 69 DNA sequence 

Nucleic Acid Accession #: NM_00S603.1 

Coding sequence: 1..3 756 



I 

ATGAGTACAG 



11 



GAACCAGAAC 
GAATGTACAT 
AACACAAAAT 
AAGTACAACG 
AATTTATATT 
TGGTACACCA 
GTGGACGATG 
ATTAAGGATG 
CGTCTGAAAA 
AACAGCCTCT 
ATGTCACTTG 
GGTTTTATTG 
TGGAGAAACA 
AGGAACACCG 
AAGAATAGTG 
GTTTACACGA 
TATTGGGAAG 
CCCTCCTACC 
CCCATCTCTC 
TGGGACCTGC 
CTCAATGAAC 
CAAAATATCA 
GATGCCTCTC 
GCTGATGGGA 
GAGCCAGAAG 
AGGACTGATG 
GCTGCCAGGA 
GAACTGGGCA 
AAGCGAATGT 
GCTGACACTG 
GATGCCCTGG 
ATTGAAGAAA 
ACCAACCGGG 
CTGGGAGCTA 
CTTGCAAAAG 
AATATAGGAT 
ATTAATTCTC 
AAGTTTGCAC 
ATCACTGGTT 
CTGAAGCTGA 
AGGCTAGAAG 
AGCGCAGTCA 
AAGAGGTACA 
ATCAAAACTG 
TCGAGTGACT 
CGATGGTCTT 
TTTACTTTGG 
GAGGATTGGT 
GGGCTGCTCG 
GTGGGACAAA 
GTCCTAACAT 
CAGGATGGAG 



AAAGAGACTC 
ACAGTGATGA 
AAAACCGAGT 
GGCAAGTCAA 
TCTTGTGTAT 
CATTTACCTT 
TCCTGGCTCT 
CACTAGTGCC 
TGGCTCGCCA 
GCAGGTTCAA 
AAAATGATTT 
GCTATGTGGA 
AAATCACAGA 
AATGTGAAGA 
CAAGTTTTCC 
ATTTCTGCCA 
GGAAAACCAG 
TCTTTGTTGT 
CACAGGTGGG 
GTGGATTCCT 
TCTATGTCAG 
AAATGTACTA 
AGCTCGGGCA 
TGACCTTTAA 
AACACAACCA 
AGCTTGCATT 
TACGACAGTT 
GTCAGCTCAA 
ACTTTGGCTT 
CTGAAAGGAC 
CTATCATTGT 
TTATTTATGA 
ATATCTTTGC 
AAGAATTTAC 
ACGAAGCTCT 
CAGCTATTGA 
CTGACATTAA 
TTGCTTGTGA 
TTCTTCATGC 
CTCCTGTGCA 
CTTGGTTGAA 
AGTTCCCAAG 
CTAAGAAAGA 
TCTGCTGCCG 
AGAAAGCCAT 
CCCACATTGG 
ATTCCTTTGC 
ACATAAGGAT 
TTCATTTCTG 
TCATCACCCT 
ACCAGGATGT 
GAGACTTACT 
CGATGATCCT 
AGGCACCTTC 



21 
I 

AGAAACGACA 
TGAAACAGAA 
CAACAGGGAA 
AGCAAACGAT 
TAAGGAGAGT 
TATACCAATG 
TCTTATCTTA 
CCTGCTTGTG 
TAAAATGGAT 
AGTTGCTAAG 
TGTTCCAGCT 
AACAGCAGAA 
CCAGTACCTC 
ACCCAATAAC 
TTTGGATGCT 
CGGCTTAGTC 
ATTTAAAAGA 
TCTTATTCTG 
CAATTCCTCT 
CATTTTCTGG 
CGTGGAAGTG 
TGCTGAGAAG 
GATCCATTAT 
AAAGTGCTGT 
CAACAAAATA 
TTATGACCAC 
CTTCTTCTTG 
CTACCAGGCA 
TGCCTTCCTC 
TTACAATGTT 
AAGAACCCCA 
ACGGTTACAT 
AAATGAAACT 
AGAATGGAAT 
GGATAAAGTA 
AGACAAGCTA 
GATCTGGGTG 
ACTTCTGACT 
AAGGATGGAA 
GGAATCTTTT 
TGAAATTCTT 
AACAGAAGAA 
GCAGCGGCAG 
CGTCACCCCC 
CACGCTGGCC 
CGTTGGAATA 
TCAGTTCCGA 
GTGCAAGTTC 
GTACTCCTTC 
CTACAACGTG 
GAGTGACAAA 
ATTCAACTAT 
CTTCTTCATA 
CGACTACCAG 



31 



TTTGACGAGG 
GATGAACTTG 
GCAGAGGAGA 
CGCAAGTACC 
AAATATGCGA 
AATCTGTTTG 
CAGGCAGTTC 
GTGCTGGGCG 
AAGGAAATCA 
TGGAAAGAAA 
GACATTCTCC 
CTGGACGGAG 
CAAAGAGAAG 
CGACTAGATA 
GATAAAATTT 
ATTTTTGCAG 
ACTAAAATTG 
CTTTCTGCTG 
TGGTACCTCT 
GGCTATATCA 
ATTCGTCTTG 
GACACACCCG 
ATCTTCTCTG 
ATCAACGGGC 
GAGCAAGTTG 
TATCTTATTG 
CTCGCAGTTT 
GCCTCTCCCG 
GCCAGGACCC 
CTTGCCATTT 
GAAGGCAATA 
CGAATGAATC 
CTTAGAACCC 
AAAAAGTTTA 
TATGAGGAGA 
CAGGATGGAG 
CTTACTGGAG 
GAAGACACCA 
AACCAGAGGA 
TTTCCACCCG 
CTCGAGAAAA 
GAAAGACGGA 
AAAAACTTTG 
AAGCAGAAGG 
ATCGGAGATG 
AGTGGACAAG 
TATCTGCAGA 
CTACGATACT 
TTCAATGGCT 
CTGTACACCA 
CTGAGCCTCC 
AAGAGATTCT 
CCTCTTGGAG 
TCTTTTGCCG 



41 
I 

ATTCTCAGCC 
ATGACCAGGG 
ACCGGGAGCC 
ACGAACAACC 
ATAATGCAAT 
AGCAGTTTAA 
CTCAAATCTC 
TCACTGCAAT 
ACAATAGGAC 
TTCAAGTTGG 
TGCTGTCTAG 
AAACCAATTT 
ATACATTGGC 
AGTTTACAGG 
TGTTACGTGG 
GTGCTGACAC 
ATTACTTGAT 
GTCTTGCCAT 
ATGATGGAGA 
TTGTTCTCAA 
GACAGAGTCA 
CAAAAGCTAG 
ATAAGACGGG 
AGATATATGG 
ATTTTAGCTG 
AGCAAATCCA 
GCCACACAGT 
ATGAAGGTGC 
AGAACACCAT 
TGGACTTCAA 
TCAAGCTTTA 
CTACTAAGCA 
TATGCCTTTG 
TGGCTGCCAG 
TTGAAAAAGA 
TTCCAGAAAC 
ACAAAAAGGA 
CCATCTGCTA 
ATAGAGGTGG 
GTGGAAACCG 
AGACCAAGAG 
TGCGGACCCA 
TGGACCTGGC 
CCATGGTGGT 
GGGCCAATGA 
AAGGAATGCA 
GGCTACTGCT 
TCTTTTACAA 
ACTCTGCGCA 
GCCTGCCCGT 
GATTCCCTGG 
TTGTAAGCTT 
CTTATCTGCA 
TCACCATTGC 



51 
I 

TAATGACGAA 
GTCTGCTGTT 
ATTCAGAAAA 
TCACTTTATG 
TAAAACATAC 
GAGAGCAGCC 
TACCCTGGCT 
CAAAGACCTG 
GTGTGAAGTC 
AGACGTCATT 
CTCTGAGCCT 
AAAATTTAAG 
TACATTTGAT 
AACACTATTT 
CTGTGTAATT 
TAAAATAATG 
GAACTACATG 
CGGCCATGCT 
AGACGATACA 
CACCATGGTA 
CTTCATCAAC 
AACCACCACA 
GACACTCACA 
GGACCATCGG 
GAATACATAT 
GTCAGGGAAA 
CATGGTGGAT 
CCTGGTAAAC 
CACCATCAGT 
CAGTGACCGG 
CTGTAAAGGT 
AGAAACACAG 
CTACAAGGAA 
TGTGGCCTCC 
CTTAATTCTC 
CATTTCAAAA 
AACTGCTGAA 
TGGGGAGGAT 
CGTCTACGCA 
TGCCTTAATC 
AAATAAGATT 
AAGTAAAAGG 
CTGCGAGTGC 
GGACCTGGTG 
CGTGAACATG 
AGCTGTCATG 
GGTGCATGGC 
AAACTTTGCC 
GACTGCATAC 
GCTCCTCATG 
GTTATACATA 
GTTGCATGGG 
AACCGTAGGG 
CTCTGCTCTT 
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GTAATAACAG TCAATTTCCA GATTGGCTTQ GATACTTCTT ATTGGACTTT TGTGAATGCT 3300 

TTTTCAATTT TTGGAAGCAT TGCACTTTAT TTTGGCATCA TGTTTGACTT TCATAGTGCT 3360 

GGAATACATG TTCTCTTTCC ATCTGCATTT CAATTTACAG GCACAGCTTC AAACGCTCTG 3420 

AGACAGCCAT ACATTTGGTT AACTATCATC CTGACTGTTG CTGTGTGCTT ACTACCCGTC 3480 

GTTGCCATTC GATTCCTGTC AATGACCATC TGGCCATCAG AAAGTGATAA GATCCAGAAG 3540 

CATCGCAAGC GGTTGAAGGC GGAGGAGCAG TGGCAGCGAC GGCAGCAGGT GTTCCGCCGG 3600 

GGCGTGTCAA CGCGGCGCTC GGCCTACGCC TTCTCGCACC AGCGGGGCTA CGCGGACCTC 3660 

ATCTCCTCCG GGCGCAGCAT CCGCAAGAAG CGCTCGCCGC TTGATGCCAT CGTGGCGGAT 372 0 
GGCACCGCGG AGTACAGGCG CACCGGGGAC AGCTGA 



Seq ID NO: 70 Protein Sequence 
Protein Accession ft: NP_00SS94.l 

1 11 21 31 41 51 

I I I I I I 

MSTERDSETT FDEDSQPNDE WPYSDDETE DELDDQGSAV EPEQNRVNRE AEENREPFRK 60 

ECTWQVKAND RKYHEQPHFM NTKFLCIKES KYANNAIKTY KYNAFTFIPM NLFEQFKRAA 120 

NLYFLALLIL QAVPQISTLA WYTTLVPLLV VLGVTAIKDI* VDDVARHKMD KEINNRTCEV 180 

IKDGRFKVAK WKEIQVGDVI RLKKNDFVPA DIIiLLSSSEP NSLCYVETAE LDGETNLKFK 240 

MSLEITDQYL QREDTLATFD GFIECEEPNN RLDKFTGTLF WRNTSFPLDA DKILLRGCVI 300 

RNTDFCHGLV IFAGADTKIM KNSGKTRFKR TKIDYLMNYM VYTIFWLIL LSAGLAIGHA 360 

YWEAQVGNSS WYLYDGEDDT PSYRGFLIFW GYIIVLNTMV PISLYVSVEV IRLGQSHFIN 420 

WDLQMYYAEK DTPAKARTTT LNEQLGQIHY 1FSDKTGTLT QNIMTFKKCC INGQIYGDHR 480 

DASQHNHNKI EQVDFSWNTY ADGKLAFYDH YLIEQIQSGK EPEVRQFFFL LAVCHTVMVD 540 

RTDGQLNYQA ASPDEGALVN AARNFGFAFL ARTQNTITIS ELGTERTYNV LAILDFNSDR 600 

KRMSIIVRTP EGNIKLYCKG ADTVIYERLH RMNPTKQETQ DALDIFANET LRTLCLCYKE 660 

IEEKEFTEWN KKFMAASVAS TNRDEALDKV YEEIEKDLIL LGATAIEDKL QDGVPETISK 720 , 

LAKADIKIWV LTGDKKETAE NIGFACELLT EDTTICYGED INSLLHARME NQRNRGGVYA 780 

KFAPPVQESF FPPGGNRALI ITGSWLNEIL LEKKTKRNKI LKLKFPRTEE ERRMRTQSKR 840 

RLEAKKEQRQ KNFVDLACEC SAVICCRVTP KQKAMWDLV KRYKKAITLA IGDGANDVNM 900 

IKTAHIGVGI SGQEGMQAVM SSDYSFAQFR YLQRLLLVHG RWSYIRMCKF LRYFFYKNFA 960 

FTLVHFWYSF FNGYSAQTAY EDWFITLYNV LYTSLPVLLM GLLDQDVSDK LSLRFPGLYI 1020 

VGQRDLLFNY KRFFVSLLHG VLTSMILFFI PLGAYLQTVG QDGEAPSDYQ SFAVTIASAL 1080 

VITVNFQIGL DTSYWTFVNA FSIFGSIALY FGIMFDFHSA GIHVLFPSAF QFTGTASNAL 1140 

RQPYIWLTII LTVAVCLLPV VAIRFLSMTI WPSESDKIQK HRKRLKAEEQ WQRRQQVFRR 1200 
GVSTRRSAYA FSHQRGYADL ISSGRSIRKK RSPLDAIVAD GTAEYRRTGD S 

Seq ID NO: 71 DNA sequence 

Nucleic Acid Accession #: NM__138784.1 

Coding sequence: 334-816 

1 11 21 31 41 51 

GGGGATACGG AAAACAGAAT AGAAGCACAG AACAAGGACT ATTGTATATA AGAGAAGCAG 60 

GTGTTAATGT AAGTTGCTGT GGAATGACTG TTTTCAAGAA GAGAGAAGGA GCCAATATAT 120 

CTACTTTCAT TAAGAAGCTG AAATTAAATT TGGATTTCCT TCTTTCCAAA GCTGAGACAC 180 

GATGGCCAAA AAGCGATTGC TGTGATTGGA GCTGGAATTA GCGGACTGGG GGCCATCAAG 240 

TGCTGCCTGG ATGAAGATCT GGAGCCCACC TGCTTTGAAA GAAATGATGA TATTGGACAT 300 

CTCTGGAAAT TTCAAAAAAA TACTTCAGAG AAAATGCCTA GTATCTACAA ATCTGTGACC 360 

ATCAATACTT CCAAGGAGAT GATGTGCTTC AGTGACTTCC CTGTCCCTGA TCATTTTCCC 420 

AACTACATGC ACAACTCCAA ACTCATGGAC TACTTCGGGA TGTATGCCAC ACACTTTGGC 480 

CTCCTGAATT ACATTCGTTT TAAGACTGAA GTGCAAAGTG TGAGGAAGCA CCCAGATTTT 540 

TCTATCAATG GACAATGGGA TGTTGTTGTG GAGACTGAAG AGAAACAAGA GACTTTGGTC 600 

TTTGATGGGG TCTTAGTTTG CAGTGGACAC CACACAGATC CCTACTTACC ACTTCAGTCC 660 

TTCCCAGGCA TTGAGAAATT TGAAGGCTGT TATTTCCATA GTCGGGAATA CAAAAGTCCC 720 

GAGGACTTTT CAGGGAAAAG AATCATAGTG ATCGGCATTG GAAATTCTGG AGTGGATATT 780 

GCGGTGGAGC TCAGTCGTGT AGCAAAACAG ATTTGACCAG TAGGAACTGA GATTCAACGT 840 

CCCCAAGACT GCAAAAGTAC CTACAATCAA ACAAATTAAT ATATTTCCTA ATTGGATCAG 900 

CATGCAGACA AGTTTACAAA TATATTACAG AAAGATTTGA AGAGGGATTC TGGTTACGTT 960 

CAGTTAAGCA GGACACTAAT CCTGAACAAT CTGATGTATT AAAGTTTCTA ATGTTTGTAT 1020 

CTTTGGTTTG ATGCATACAA TAGAGTGGAA GTCTGTAGTA GTTAATATGA TGATAATTCG 1080 

CTTCTAAATT TAAATCCTTA TTTATATATG AAGCATAGCT ATGTCAATGT CCAGGGAAGC 1140 

ATTTTTAAAT AGTAAATCAG AAACATGTAA ACTAGTGAAT ACAATTTTCT ATTTTTCATT 1200 

CAGAAATTTG TTGTCATAGA TGCAGCCTTT ATCTGGAAAT CAGTGGGCAG CTCTGGTCAA 1260 

GCACAGCATG GCGACCACTG TGTAGTATTT TACCACCTAA TCCTTAGTAG CCAACACTGT 1320 

TAAATCTGCA TAATGGGTAG CATATGAATA AAATAATATC TTGTAAAAAA AAAAAAAAAA 1380 
AAAAAAAAAA AAAA 

Seq ID NO: 72 Protein sequence: 
Protein Accession #: NP_620139.1 

1 11 21 31 41 .51 

ilPSIYKSVTI NTSKEMMCFS DFPVPDHFPN YMHNSKLMDY FGMYATHFGL LNYIRFKTEV 60 
QSVRKHPDFS INGQWDWVE TEEKQETLVF DGVLVCSGHH TDPYLPLQSF PGIEKFEGCY 120 
FHSREYKSPE DFSGKRIIVI GIGNSGVDIA VELSRVAKQI 
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Seq ID NO: 73 DNA sequence 
Nucleic Acid Accession 8: NMJ)03661.1 
Coding sequence: 1..1152 

1 11 21 31 41 5i 

I I I I I I 

ATGAGTGCAC TTTTCCTTGG TGTGGGAGTG AGGGCAGAGG AAGCTGGAGC GAGGGTGCAA 60 

CAAAACGTTC CAAGTGGGAC AGATACTGGA GATCCTCAAA GTAAGCCCCT CGGTGACTGG 120 

GCTGCTGGCA CCATGGACCC AGAGAGCAGT ATCTTTATTG AGGATGCCAT TAAGTATTTC 180 

AAGGAAAAAG TGAGCACACA GAATCTGCTA CTCCTGCTGA CTGATAATGA GGCCTGGAAC 240 

GGATTCGTGG CTGCTGCTGA ACTGCCCAGG AATGAGGCAG ATGAGCTCCG TAAAGCTCTG 300 

GACAACCTTG CAAGACAAAT GATCATGAAA GACAAAAACT GGCACGATAA AGGCCAGCAG 360 

TACAGAAACT GGTTTCTGAA AGAGTTTCCT CGGTTGAAAA GTGAGCTTGA GGATAACATA 420 

AGAAGGCTCC GTGCCCTTGC AGATGGGGTT CAGAAGGTCC ACAAAGGCAC CACCATCGCC 480 

I J AATGTGGTGT CTGGCTCTCT CAGCATTTCC TCTGGCATCC TGACCCTCGT CGGCATGGGT 540 

CTGGCACCCT TCACAGAGGG AGGCAGCCTT GTACTCTTGG AACCTGGGAT GGAGTTGGGA 600 

ATCACAGCCG CTTTGACCGG GATTACCAGC AGTACCATGG ACTACGGAAA GAAGTGGTGG 660 

ACACAAGCCC AAGCCCACGA CCTGGTCATC AAAAGCCTTG ACAAATTGAA GGAGGTGAGG 720 

GAGTTTTTGG GTGAGAACAT ATCCAACTTT CTTTCCTTAG CTGGCAATAC TTACCAACTC 780 

ACACGAGGCA TTGGGAAGGA CATCCGTGCC CTCAGACGAG CCAGAGCCAA TCTTCAGTCA 840 

GTACCGCATG CCTCAGCCTC ACGCCCCCGG GTCACTGAGC CAATCTCAGC TGAAAGCGGT 900 

GAACAGGTGG AGAGGGTTAA TGAACCCAGC ATCCTGGAAA TGAGCAGAGG AGTCAAGCTC 960 

ACGGATGTGG CCCCTGTAAG CTTCTTTCTT GTGCTGGATG TAGTCTACCT CGTGTACGAA 1020 

TCAAAGCACT TACATGAGGG GGCAAAGTCA GAGACAGCTG AGGAGCTGAA GAAGGTGGCT 1080 

CAGGAGCTGG AGGAGAAGCT AAACATTCTC AACAATAATT ATAAGATTCT GCAGGCGGAC 114 0 
CAAGAACTGT GA 
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Seq ID NO: 74 Protein Sequence 
Protein Accession tf: NP_003652.1 

1 11 21 31 41 51 

I I I I I I 

. MSALFLGVGV RAEEAGARVQ QNVPSGTDTG -DPQSKPLGDW AAGTMDPESS IFIEDAIKYF 60 

KEKVSTQNLL LLLTDNEAWN GFVAAAELPR NEADELRKAL DNLARQMIMK DKNWHDKGQQ 120 

YRNWFLKEFP RLKSELEDNI RRLRALADGV QKVHKGTTIA NWSGSLSIS SGIKTLVGMG 180 

LAPFTEGGSL VLLEPGMELG ITAALTGITS STMDYGKKWW TQAQAHDLVI KSLDKLKEVR 240 

EFLGENISNF LSLAGNTYQL TRGIGKDIRA LRRARANLQS VPHASASRPR VTEPISAESG 300 

EQVERVNEPS ILEMSRGVKL TDVAPVSFFL VLDWYLVYE SKHLHEGAKS ETAEELKKVA 360 
QELEEKLNIL NNNYKILQAD QEL 

Seq ID NO: 75 DNA sequence 

Nucleic Acid Accession #: NM_014452.1 

Coding sequence : 1 . . 1 9 6 8 



45 1 11 21 31 41 51 

I 1 I I I 1 

ATGGGGACCT CTCCGAGCAG CAGCACCGCC CTCGCCTCCT GCAGCCGCAT CGCCCGCCGA 60 

GCCACAGCCA CGATGATCGC GGGCTCCCTT CTCCTGCTTG GATTCCTTAG CACCACCACA 120 

GCTCAGCCAG AACAGAAGGC CTCGAATCTC ATTGGCACAT ACCGCCATGT TGACCGTGCC 130 

50 ACCGGCCAGG TGCTAACCTG TGACAAGTGT CCAGCAGGAA CCTATGTCTC TGAGCATTGT 240 

ACCAACACAA GCCTGCGCGT CTGCAGCAGT TGCCCTGTGG GGACCTTTAC CAGGCATGAG 300 

AATGGCATAG AGAAATGCCA TGACTGTAGT CAGCCATGCC CATGGCCAAT GATTGAGAAA 360 

TTACCTTGTG CTGCCTTGAC TGACCGAGAA TGCACTTGCC CACCTGGCAT GTTCCAGTCT 420 

AACGCTACCT GTGCCCCCCA TACGGTGTGT CCTGTGGGTT GGGGTGTGCG GAAGAAAGGG 480 

55 ACAGAGACTG AGGATGTGCG GTGTAAGCAG TGTGCTCGGG GTACCTTCTC AGATGTGCCT 540 

TCTAGTGTGA TGAAATGCAA AGCATACACA GACTGTCTGA GTCAGAACCT GGTGGTGATC 600 

AAGCCGGGGA CCAAGGAGAC AGACAACGTC TGTGGCACAC TCCCGTCCTT CTCCAGCTCC 660 

ACCTCACCTT CCCCTGGCAC AGCCATCTTT CCACGCCCTG AGCACATGGA AACCCATGAA 720 

GTCCCTTCCT CCACTTATGT TCCCAAAGGC ATGAACTCAA CAGAATCCAA CTCTTCTGCC 780 

60 TCTGTTAGAC CAAAGGTACT GAGTAGCATC CAGGAAGGGA CAGTCCCTGA CAACACAAGC 840 

TCAGCAAGGG GGAAGGAAGA CGTGAACAAG ACCCTCCCAA ACCTTCAGGT AGTCAACCAC 900 

CAGCAAGGCC CCCACCACAG ACACATCCTG AAGCTGCTGC CGTCCATGGA GGCCACTGGG 960 

GGCGAGAAGT CCAGCACGCC CATCAAGGGC CCCAAGAGGG GACATCCTAG ACAGAACCTA 1020 

CACAAGCATT TTGACATCAA TGAGCATTTG CCCTGGATGA TTGTGCTTTT CCTGCTGCTG 1080 

65 GTGCTTGTGG TGATTGTGGT GTGCAGTATC CGGAAAAGCT CGAGGACTCT GAAAAAGGGG 1140 

CCCCGGCAGG ATCCCAGTGC CATTGTGGAA AAGGCAGGGC TGAAGAAATC CATGACTCCA 1200 

ACCCAGAACC GGGAGAAATG GAT CT ACT AC TGCAATGGCC ATGGTATCGA TATCCTGAAG 1260 

CTTGTAGCAG CCCAAGTGGG AAGCCAGTGG AAAGATATCT ATCAGTTTCT TTGCAATGCC 1320 

_ AGTGAGAGGG AGGTTGCTGC TTTCTCCAAT GGGTACACAG CCGACCACGA GCGGGCCTAC 1380 

70 GCAGCTCTGC AGCACTGGAC CATCCGGGGC CCCGAGGCCA GCCTCGCCCA GCTAATTAGC 1440 

GCCCTGCGCC AGCACCGGAG AAACGATGTT GTGGAGAAGA TTCGTGGGCT GATGGAAGAC 1500 

ACCACCCAGC TGGAAACTGA CAAACTAGCT CTCCCGATGA GCCCCAGCCC GCTTAGCCCG 1560 

AGCCCCATCC CCAGCCCCAA CGCGAAACTT GAGAATTCCG CTCTCCTGAC GGTGGAGCCT 1620 

TCCCCACAGG ACAAGAACAA GGGCTTCTTC GTGGATGAGT CGGAGCCCCT TCTCCGCTGT 1680 

75 GACTCTACAT CCAGCGGCTC CTCCGCGCTG AGCAGGAAOG GTTCCTTTAT TACCAAAGAA 1740 

AAGAAGGACA CAGTGTTGCG GCAGGTACGC CTGGACCCCT GTGACTTGCA GCCTATCTTT 1800 

GATGACATGC TCCACTTTCT AAATCCTGAG GAGCTGCGGG TGATTGAAGA GATTCCCCAG 1860 

GCTGAGGACA AACTAGACCG GCTATTCGAA ATTATTGGAG TCAAGAGCCA GGAAGCCAGC 1920 

257 
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CAGACCCTCC TGGACTCTGT TTATAGCCAT CTTCCTGACC TGCTGTAG 



Seq ID NO: 76 Protein Sequence 
Protein Accession If: NP 055267.1 
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MGTSPSSSTA 
TGQVLTCDKC 
LPCAALTDRE 
SSVMKCKAYT 
VPSSTYVPKG 
QQGPHHRHIL 
VLWIWCSI 
LVAAQVGSQW 
ALRQHRRNDV 
SPQDKNKGFF 
DDMLHFLNPE 



11 

I 

LASCSRIARR 
PAGTYVSEHC 
CTCPPGMFQS 
DCL§QNLWI 
MNSTESNSSA 
KLLPSMEATG 
RKSSRTLKKG 
KDIYQFLCNA 
VEKIRGLMED 
VDESEPLLRC 
ELRVIEEIPQ 



21 
I 

ATATMIAGSL 
TNTSLRVCSS 
NATCAPHTVC 
KPGTKETDNV 
SVRPKVLSSI 
GEKSSTPIKG 
PRQDPSAIVE 
SEREVAAFSN 
TTQLETDKLA 
DSTSSGSSAL 
AEDKLDRLFE 



31 
i 

LLLGFLSTTT 
CPVGTFTRHE 
PVGWGVRKKG 
CGTLPSFSSS 
QEGTVPDNTS 
PKRGHPRQNL 
KAGLKKSMTP 
GYTADHERAY 
LPMSPSPLSP 
SRNGSFITKE 
IIGVKSQEAS 



41 

I 

AQPEQKASNL 
NGIEKCHDCS 
TETEDVRCKQ 
TSPSPGTAIF 
SARGKEDVNK 
HKHFDINEHL 
TQNREKWIYY 
AALQHWTIRG 
SPIPSPNAKL 
KKDTVLRQVR 
QTLLDSVYSH 



51 
I 

IGTYRHVDRA 
QPCPWPMIEK 
CARGTFSDVP 
PRPEHMETHE 
TLPNLQWNH 
PWMIVLFLLL 
CNGHGIDILK 
PEASLAQliIS 
ENSALLTVEP 
LDPCDLQPIF 
LPDLL 
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Seq ID NO: 77 DNA sequence 

Nucleic Acid Accession #: NM_003105.3 

Coding sequence: 123.. 6767 



25 j 



GGCGCGGGCG 
TGGCCTCGCG 
ACATGGCGAC 
CACTGCTGCC 
CGCCCTTGCC 
TGTGGGCGCG 
AACGGAGCGC 
ATTCCCACAA 
TGGCCCGAGA 
ACTATGGAAA 
GTGAAGCTGT 
CAGACGCTTA 
TTTCCATCCC 
TGGGCTTTGA 
CCTGGATCAT 
AACCAAATAC 
GTACAGATTT 
TTCAGCTTCG 
AGCAGTCTTC 
TTGTCACAAG 
TTGTGTGTGT 
TGAAGTTCTC 
ACACCTTGGT 
TGCAAGGAGT 
CGGTCATCAC 
GATATGGAGA 
GCCTCAGTCA 
CTCCAGGCCT 
TGTACATCTC 
ACACATGGGG 
AGCTAAAATA 
CAGTGTTTGT 
TTGGCTCGAA 
CCTTGGGAGT 
GGAATGAGTG 
GCTTCAATGG 
AGGACTATGA 
TTCCAGATCC 
CTACTTACAG 
ATGTTGAAGC 
TCATTCTGTA 
AGCAGTTGCC 
ACTGTTTGTA 
GCACAGGGCA 
AACCCCTCAG 
ATCCAGATGG 
CTCTGGTCCT 
CTGGGATTTA 
TGAAGTGGCC 
TGGAGTGCAT 
TCCCGCACCC 
AGCTCAGCAT 
AGCTCACGGG 



11 
! 

GCCTGGAGCC 
CTGCACATTC 
ACGGAGCAGC 
GCCCGGAGCT 
CCAGGACCGG 
CGGGGATGCC 
TGCCCTGCAG 
TCAGATGGTG 
TAGCCTGGCA 
ATCATTCAAG 
TATCGCCCAG 
TGCCCAGTAC 
ATTTCGGGCA 
CAGGTCCCAC 
GATTCAGGAA 
CATCTACATT 
CTTCCAGTCC 
GGACAAGTAC 
TGTCCAGCTC 
ACATCCTATT 
CAGCCACAGT 
CCTGTCCTTG 
GAGGTATTTT 
CTACATTGCT 
CTTTGACAAA 
GAAAATCAAT 
GCTCCTCAAC 
CATCATCGCC 
TAGCAGTGCT 
AGACCACGGC 
CAGTACCAAT 
GTATGGCCTC 
CAAAGAGAAT 
TCCCTGCACA 
TTTGCTGGGA 
AGAGGACTTT 
GTGTGACTTC 
GGAATTTTCT 
GAGAACGAGA 
GCGACTGGAA 
TGCTGTGAGG 
TCTCACCGGG 
TTGGTCCGAC 
AGAGGTGATC 
CCAGCTGCTT 
CGACTTCCGA 
CGTGCCCCAA 
TCGGAGCAAT 
CAATGGCATC 
AGAGCGGATC 
CTATGCCATT 
ATTCCGAGCT 
GCTCATGGAC 



21 



CCGGGAGCGG 
TCTCCTGGCG 
AGGAGGGAGT 
CTCTGCGAAG 
GGCTTCCTCG 
AGGGGGGCGA 
CCCGAGCCCA 
GTGCACTGGG 
TTGGCGAGGC 
AAAATTTCAG 
TTCTACCACA 
CTCTGGATCA 
GCTGATCTCC 
CCCAACAAGC 
CATGTCAAGT 
GAACGACACG 
CGGGAAAACC 
ATGTTTGCTA 
TGGGTCTCCT 
AATGAATATT 
AACAACCGCA 
GAGAACGTGC 
GCAAATGAAC 
ACTCTGATTA 
GGGGGAACCT 
TGTGAGCTTT 
CTCCAGCTCC 
ACTGGCTCAG 
GGAGCCAGGT 
GGAATCATCA 
GAAGGGGAGA 
CTCACAGAAC 
GTCCACAGCT 
GAGAATGACT 
CACAAGACTG 
GACAGGCCGG 
GGTTTCAAGA 
GGAAAGTCAT 
GGCTACCGGA 
GGAGAGCTGG 
AAATCCATCT 
CTACGGGCAG 
CTGGCCTTGG 
ATCAATTCTG 
TACTGGGTAG 
CTCACAATCG 
GAGGGGGTGA 
ATGGATGGTT 
TCTGTGGACG 
ACGTTCAGTG 
GCTGTCTTTA 
TCCAAATACA 
ATGAAGATTT 



31 
I 

CGCGCGCGGT 
GCGGCGCCAC 
CGCGACTCCC 
TCTGGACGCA 
TGGTGCAGGG 
GCCGCGCGGA 
TCAAGGTGTA 
CTGGAGAGAA 
CCAAGAGCAG 
ACAAGTTAAA 
GCCCTGCGGA 
CGTTTGACTT 
TCCTACACAG 
AGCTGTGGAA 
CCTTTTCTTG 
AACCCTCTGG 
AGGAAGTGAT 
CAAAGGTGGT 
TTGGCCGGAA 
ACATCGCAGA 
CCAATTTATA 
TCTATTACAG 
CATTTGCTGA 
ATGGTTCTAT 
GGGAGTTTCT 
CCCAGGGCTG 
GGAGAATGCC 
TGGGAAAGAA 
GGCGAGAGGC 
CGGCCATTGC 
CCTGGAAAAC 
CTGGGGAGAA 
GGCTGATCCT 
ACAAGCTGTG 
TTTTCAAACG 
TGGTCGTGTC 
TGAGTGAAGA 
ACTCCCCTCC 
AGATTTCTGG 
TCCCCTGTCC 
ACCGCTATGA 
CAGTGGCCCT 
ACGTCATCCA 
GCCTGGAGAC 
ATGCAGGCTT 
TCAATTCCTC 
TGTTCTGGAC 
CTGCTGCCTA 
ACCAGTGGAT 
GCCAGCAGCG 
AGAATGAAAT 
GTGGGTCCCA 
TCTACAAGGG 



41 
I 

CCCGGCCCAG 
CTGCAGTAGC 
GTTCCTATTC 
GAGGCTGCAC 
CGACCCGCGC 
CGAGAAGCCG 
CGGACAGGTT 
AAGCAACGTG 
TGATGTGTAC 
CTTTGGCTTG 
CAACAAGCGG 
CTGCAACACT 
TAAGGCCTCC 
GTCAGATGAC 
GGGAATTGAT 
CTACTCCACT 
CCTTGAGGAA 
GCATCTCTTG 
GCCCATGAGA 
TGCCTCCGAG 
CATCTCAGAG 
CCCAGGAGGG 
CTTCCACCGA 
GAATGAGGAG 
TCAGGCTCCA 
TTCCCTTCAT 
CATCCTGTCC 
CTTGGCTAGC 
ACTTCCTGGA 
CCAGGGCATG 
ATTCATCTTC 
GAGCACTGTC 
CCAGGTCAAT 
GTCACCATCT 
GCGGACCCCC 
CAACTGCTCC 
TTTGTCATTA 
TGTGCCTTGC 
GGACACTTGT 
CCTGGCAGAA 
CCTGGCCTCG 
GGACTTTGAC 
GCGCCTCTGT 
AGTAGAAGCT 
CAAAAAGA7T 
TGTGCTTGAT 
AGACTGGGGA 
TCACCTGGTG 
TTACTGGACG 
CTCTGTCATT 
CTACTGGGAT 
GATGGAGATT 
GAAGAACACT 



51 



CCGGCTCTCC 
GTTCGCCCGA 
ACCCTGGTCG 
GGCGGCAGCG 
GAGCTGCGGC 
CTCCGGAGGA 
AGTCTGAATG 
ATCGTGGCCT 
GTGTCTTACG 
GGAAATAGGA 
TACATCTTTG 
CTTCAAGGCT 
AACCTTCTCT 
TTTGGCCAGA 
CCCTATGACA 
GTCTTCCGAA 
GTGAGAGATT 
GGCAGTGAAC 
GCAGCCCAGT 
GACCAGGTGT 
GCAGAGGGGC 
GCCGGCAGTG 
GTGGAAGGAT 
AACATGAGAT 
GCCTTCACGG 
CTGGCTCAGC 
AAGGAGTCGG 
AAGACAAACG 
CCTCACTACT 
GAAACCAACG 
TCTGAGAAGC 
TTCACCATCT 
GCCACGGATG 
GATGAGCGGG 
CATGCCACAT 
TGCACCCGGG 
GAGGTTTGTG 
CCTGTGGGTT 
AGCGGAGGAG 
GAGAACGAGT 
GGAGCCACCG 
TATGAGCACA 
TTGAATGGAA 
TTGGCTTTTG 
GAGGTAGCTA 
CGTCCCAGGG 
GACCTGAAGC 
TCTGAGGATG 
GATGCCTACC 
CTGGACAACC 
GACTGGTCAC 
CTGGCAAACC 
GGAAGCAATG 
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660 
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780 

840 

900 

960 
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1080 
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1440 

1500 
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1860 

1920 

1980 

2040 
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CCTGTGTGCC CAGGCCATGC AGCCTGCTGT GCCTGCCCAA GGCCAACAAC AGTAGAAGCT 3240 

GCAGGTGTCC AGAGGATGTG TCCAGCAGTG TGCTTCCATC AGGGGACCTG ATGTGTGACT 3300 

GCCCTCAGGG CTATCAGCTC AAGAACAATA CCTGTGTCAA AGAAGAGAAC ACCTGTCTTC 3360 

GCAACCAGTA TCGCTGCAGC AACGGGAACT GTATCAACAG CATTTGGTGG TGTGACTTTG 3420 

J ACAACGACTG TGGAGACATG AGCGATGAGA GAAACTGCCC TACCACCATC TGTGACCTGG 3480 

ACACCCAGTT TCGTTGCCAG GAGTCTGGGA CTTGTATCCC ACTGTCCTAT AAATGTGACC 3540 

TTGAGGATGA CTGTGGAGAC AACAGTGATG AAAGTCATTG TGAAATGCAC CAGTGCCGGA 3600 

GTGACGAGTA CAACTGCAGT TCCGGCATGT GCATCCGCTC CTCCTGGGTA TGTGACGGGG 3660 

ACAACGACTG CAGGGACTGG TCTGATGAAG CCAACTGTAC CGCCATCTAT CACACCTGTG 3720 

1U AGGCCTCCAA CTTCCAGTGC CGAAACGGGC ACTGCATCCC CCAGCGGTGG GCGTGTGACG 3780 

GGGATACGGA CTGCCAGGAT GGTTCCGATG AGGATCCAGT CAACTGTGAG AAGAAGTGCA 3 840 

ATGGATTCCG CTGCCCAAAC GGCACTTGCA TCCCATCCAG CAAACATTGT GATGGTCTGC 3900 

G TGAT TGCTC TGATGGCTCC GATGAACAGC ACTGCGAGCC CCTCTGTACG CACTTCATGG 3960 

ACTTTGTGTG TAAGAACCGC CAGCAGTGCC TGTTCCACTC CATGGTCTGT GACGGAATCA 4020 

ID TCCAGTGCCG CGACGGGTCC GATGAGGATG CGGCGTTTGC AGGATGCTCC CAAGATCCTG 4080 

AGTTCCACAA GGTATGTGAT GAGTTCGGTT TCCAGTGTCA GAATGGAGTG TGCATCAGTT 4140 

TGATTTGGAA GTGCGACGGG ATGGATGATT GCGGCGATTA TTCTGATGAA GCCAACTGCG 4200 

AAAACCCCAC AGAAGCCCCA AACTGCTCCC GCTACTTCCA GTTTCGGTGT GAGAATGGCC 4260 

ACTGCATCCC CAACAGATGG AAATGTGACA GGGAGAACGA CTGTGGGGAC TGGTCTGATG 4320 

AGAAGGATTG TGGAGATTCA CATATTCTTC CCTTCTCGAC TCCTGGGCCC TCCACGTGTC 4380 

TGCCCAATTA CTACCGCTGC AGCAGTGGGA CCTGCGTGAT GGACACCTGG GTGTGCGACG 4440 

GGTACCGAGA TTGTGCAGAT GGCTCTGACG AGGAAGCCTG CCCCTTGCTT GCAAACGTCA 4500 

CTGCTGCCTC CACTCCCACC CAACTTGGGC GATGTGACCG ATTTGAGTTC GAATGCCACC 4560 

AACCGAAGAC GTGTATTCCC AACTGGAAGC GCTGTGACGG CCACCAAGAT TGCCAGGATG 4620 

LJ GCCGGGACGA GGCCAATTGC CCCACACACA GCACCTTGAC TTGCATGAGC AGGGAGTTCC 4680 

AGTGCGAGGA CGGGGAGGCC TGCATTGTGC TCTCGGAGCG CTGCGACGGC TTCCTGGACT 4740 

GCTCGGACGA GAGCGATGAA AAGGCCTGCA GTGATGAGTT GACTGTGTAC AAAGTACAGA 4800 

ATCTTCAGTG GACAGCTGAC TTCTCTGGGG ATGTGACTTT GACCTGGATG AGGCCCAAAA 4860 

AAATGCCCTC TGCTTCTTGT GTATATAATG TCTACTACAG GGTGGTTGGA GAGAGCATAT 4 920 

JU GGAAGACTCT GGAGACCCAC AGCAATAAGA CAAACACTGT ATTAAAAGTC TTGAAACCAG 4980 

ATACCACGTA TCAGGTTAAA GTACAGGTTC AGTGTCTCAG CAAGGCACAC AACACCAATG 5040 

ACTTTGTGAC CCTGAGGACC CCAGAGGGAT TGCCAGATGC CCCTCGAAAT CTCCAGCTGT 5100 

CACTCCCCAG GGAAGCAGAA GGTGTGATTG TAGGCCACTG GGCTCCTCCC ATCCACACCC 5160 

ATGGCCTCAT CCGTGAGTAC ATTGTAGAAT ACAGCAGGAG TGGTTCCAAG ATGTGGGCCT 5220 

D J CCCAGAGGGC TGCTAGTAAC TTTACAGAAA TCAAGAACTT ATTGGTCAAC ACTCTATACA 5280 

CCGTCAGAGT GGCTGCGGTG ACTAGTCGTG GAATAGGAAA CTGGAGCGAT TCTAAATCCA 5340 

TTACCACCAT AAAAGGAAAA GTGATCCCAC CACCAGATAT CCACATTGAC AGCTATGGTG 5400 

AAAATTATCT AAGCTTCACC CTGACCATGG AGAGTGATAT CAAGGTGAAT GGCTATGTGG 5460 

TGAACCTTTT CTGGGCATTT GACACCCACA AGCAAGAGAG GAGAACTTTG AACTTCCGAG 5520 

GAAGCATATT GTCACACAAA GTTGGCAATC TGACAGCTCA TACATCCTAT GAGATTTCTG 5580 

CCTGGGCCAA GACTGACTTG GGGGATAGCC CTCTGGCATT TGAGCATGTT ATGACCAGAG 5640 

GGGTTCGCCC ACCTGCACCT AGCCTCAAGG CCAAAGCCAT CAACCAGACT GCAGTGGAAT 5700 

GTACCTGGAC CGGCCCCCGG AATGTGGTTT ATGGTATTTT CTATGCCACG TCCTTTCTTG 5760 

ACCTCTATCG CAACCCGAAG AGCTTGACTA CTTCACTCCA CAACAAGACG GTCATTGTCA 5820 

GTAAGGATGA GCAGTATTTG TTTCTGGTCC GTGTAGTGGT ACCCTACCAG GGGCCATCCT 5880 

CTGACTACGT TGTAGTGAAG ATGATCCCGG ACAGCAGGCT TCCACCCCGT CACCTGCATG 5940 

TGGTTCATAC GGGCAAAACC TCCGTGGTCA TCAAGTGGGA ATCACCGTAT GACTCTCCTG 6000 

ACCAGGACTT GTTGTATGCA ATTGCAGTCA AAGATCTCAT AAGAAAGACT GACAGGAGCT 6060 

ACAAAGTAAA ATCCCGTAAC AGCACTGTGG AATACACCCT TAACAAGTTG GAGCCTGGCG 6120 

GGAAATACCA CATCATTGTC CAACTGGGGA ACATGAGCAA AGATTCCAGC ATAAAAATTA 6180 

CCACAGTTTC ATTATCAGCA CCTGATGCCT TAAAAATCAT AACAGAAAAT GATCATGTTC 6240 

TTCTGTTTTG GAAAAGCCTG GCTTTAAAGG AAAAGCATTT TAATGAAAGC AGGGGCTATG 6300 

AGATACACAT GTTTGATAGT GCCATGAATA TCACAGCTTA CCTTGGGAAT ACTACTGACA 6360 

ATTTCTTTAA AATTTCCAAC CTGAAGATGG GTCATAATTA CACGTTCACC GTCCAAGCAA 6420 

GATGCCTTTT TGGCAACCAG ATCTGTGGGG AGCCTGCCAT CCTGCTGTAC GATGAGCTGG 64 80 

GGTCTGGTGC AGATGCATCT GCAACGCAGG CTGCCAGATC TACGGATGTT GCTGCTGTGG 6540 

TGGTGCCCAT CTTATTCCTG ATACTGCTGA GCCTGGGGGT GGGGTTTGCC ATCCTGTACA 6600 

CGAAGCACCG GAGGCTGCAG AGCAGCTTCA CCGCCTTCGC CAACAGCCAC TACAGCTCCA 6660 

GGCTGGGGTC CGCAATCTTC TCCTCTGGGG ATGACCTGGG GGAAGATGAT GAAGATGCCC 6720 

CTATGATAAC TGGATTTTCA GATGACGTCC CCATGGTGAT AGCCTGAAAG AGCTTTCCTC 6780 

ACTAGAAACC AAATGGTGTA AATATTTTAT TTGATAAAGA TAGTTGATGG TTTATTTTAA 6840 



AAGATGCACT TTGAGTTGCA ATATGTTATT TTTATATGGG CCAAAAACAA AAAACAAAAA 6900 
AAAAAA 

Seq ID NO: 78 Protein Sequence 
Protein Accession #: NP 003096.1 



1 11 21 31 41 51 

- n I I I I I I 

/U MATRSSRRES RLPFLFTLVA LLPPGAliCEV WTQRLHGGSA PLPQDRGFLV VQGDPRELRL 60 

WARGDARGAS RADEKPLRRK RSAALQPEPI KVYGQVSLND SHNQMWHWA GEKSNVIVAL 120 

ARDSLALARP KSSDVYVSYD YGKSFKKISD KLNFGLGNRS EAVIAQFYHS PADNKRYIPA 180 

DAYAQYLWIT FDFCNTLQGF SIPFRAADLL LHSKASNLLL GFDRSHPNKQ LWKSDDFGQT 240 

WIMIQEHVKS FSWGIDPYDK PNTIYIERHE PSGYSTVFRS TDFFQSRENQ EVILEEVRDF 300 

75 QLRDKYMFAT KWHLLGSEQ QSSVQLWVSF GRKPMRAAQF VTRHPINEYY IADASEDQVF 360 

VCVSHSNNRT NLYISEAEGI* KFSLSLENVL YYSPGGAGSD TLVRYFANEP FADFHRVEGL 420 

QGVYIATLIN GSMNEENMRS VITFDKGGTW EFLQAPAFTG YGEKINCELS QGCSLHLAQR 480 

LSQLLNLQLR RMPILSKESA PGLIIATGSV GKNLASKTNV YISSSAGARW REALPGPHYY 540 
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TWGDHGGIIT AIAQGMETNE LKYSTNEGET WKTFIFSEKP VFVYGLLTEP GEKSTVFTIP 600 

GSNKENVHSW blLQVNATDA LGVPCTENDY KLWSPSDERG NECLLGHKTV PKRRTPHATC 660 

FNGEDFDRPV WSNCSCTRE DYECDFGFKM SEDLSLEVCV PDPEFSGKSY SPPVPCPVGS 720 

TYRRTRGYRK ISGDTCSGGD VEARLEGELV PCPLAEENEF ILYAVRKSIY RYDLASGATE 780 

QLPLTGLRAA VALDFDYEHN CLYWSDLALD VIQRLCLNGS TGQEVIINSG LETVEALAFE 840 

PLSQLLYWVD AGFKKIEVAN PDGDFRLTIV NSSVLDRPRA LVLVPQEGVM FWTDWGDLKP 900 

GIYRSNMDGS AAYHLVSEDV KWPNGISVDD QWIYWTDAYL ECIERITPSG QQRSVILDNL 960 

PHPYAIAVFK NEIYWDDWSQ LSIFRASKYS GSQMEILANQ LTGLMDMKIF YKGKNTGSNA 1020 

CVPRPCSLLC LPKANNSRSC RCPEDVSSSV LPSGDLMCDC PQGYQLKNNT CVKEENTCLR 1080 

NQYRCSNGNC INSIWWCDFO NDCGDMSDER NCPTTICDLD TQFRCQESGT CIPLSYKCDL 1140 

EDDCGDNSDE SHCEMHQCRS DEYNCSSGMC IRSSWVCDGD NDCRDWSDEA NCTAIYHTCE 1200 

ASNFQCRNGH CIPQRWACDG DTDCQDGSDE DPVNCEKKCN GFRCPNGTCI PSSKHCDGLR 1260 

DCSDGSDEQH CEPLCTHFMD FVCKNRQQCL FHSMVCDGII QCRDGSDEDA AFAGCSQDPE 1320 

FHKVCDEFGF QCQNGVC1SL IWKCDGMDDC GDYSDEANCE NPTEAPNCSR YFQFRCENGH 1380 

CIPNRWKCDR ENDCGDWSDE KDCGDSHILP FSTPGPSTCL PNYYRCSSGT CVMDTWVCDG 1440 

YRDCADGSDE EACPLLANVT AASTPTQLGR CDRFEFECHQ PKTCIPNWKR CDGHQDCQDG IS 00 

RDEANCPTHS TLTCMSREFQ CEDGEACIVL SERCDGFLDC SDESDEKACS DELTVYKVQN 1560 

LQWTADFSGD VTLTWMRPKK MPSASCVYNV YYRWGESIW KTLETHSNKT NTVLKVLKPD 1620 

TTYQVKVQVQ CLSKAHNTND FVTLRTPEGL PDAPRNLQLS LPREAEGVIV GHWAPPIHTH 1680 

GLIREYIVEY SRSGSKMWAS QRAASNFTEI KNLLVNTLYT VRVAAVTSRG IGNWSDSKSI 1740 

TTIKGKVIPP PDIHIDSYGE NYLSFTLTME SDIKVNGYW NLFWAFDTHK QERRTLNFRG 1800 

SILSHKVGNL TAHTSYEISA WAKTDLGDSP LAFEHVMTRG VRPPAPSLKA KAINQTAVEC 1860 

TWTGPRNWY GIFYATSFLD LYRNPKSLTT SLHNKTVIVS KDEQYLFLVR VWPYQGPSS 1920 

DYVWKMIPD SRLPPRHLHV VHTGKTSWI KWESPYDSPD QDLLYAIAVK DLIRKTDRSY 1980 

KVKSRNSTVE YTLNKLEPGG KYHIIVQLGN MSKDSSIKIT TVSLSAPDAL KIITENDHVL 2040 

LFWKSLALKE KHFNESRGYE IHMFDSAMNI TAYLGNTTDN FFKISNLKMG HNYTFTVQAR 2100 

CLFGNQICGE PAILLYDELG SGADASATQA ARSTDVAAW VPILFLILLS LGVGFAILYT 2160 
KHRRLQSSFT AFANSHYSSR LGSAIFSSGD DLGEDDEDAP MITGFSDDVP MVIA 



Seq ID NO: 79 DNA sequence 

Nucleic Acid Accession Eos sequence 

Coding sequence: 120-755 

1 ii 21 31 41 51 

CCTGCCACTC CCCTGCCAGT GTGCGAGGCA GCGTGAAGCT GGGGCCTGCT CCCCGCAGCC 60 

TCTGGAGCGC ATCTCAGACC TTCTGAGACC TATGTTGCTG GCCCCCCAGA ACCCGCAACA 120 

TGGCAGATGG GGCAAAGGCC AACCCCAAAG GGTTCAAAAA GAAGGTGCTG GATAGATGCT 180 

TCTCTGGGTG GAGGGGCCCA CGCTTCGGGG CCTCCTGTCC TTCAAGAACC TCCAGGTCTA 240 

GCCTGGGTAT GAAGAAGTTC TTCACCGTGG CCATCCTTGC TGGCAGCGTT CTGTCCACAG 300 

CTCACGGCAG CCTGCTCAAC CTGAAGGCCA TGGTGGAGGC CGTCACAGGG AGGAGCGCCA 360 

TCCTGTCCTT CGTGGGCTAC GGTTGCTACT GTGGGCTGGG GGGCCGTGGC CAGCCCAAGG 420 

ATGAGGTGGA CTGGTGCTGC CACGCCCACG ACTGCTGCTA CCAGGAACTC TTTGACCAAG 480 

GCTGTCACCC CTATGTGGAC CACTATGATC ACACCATCGA GAACAACACT GAGATAGTCT 540 

GCAGTGACCT CAACAAGACA GAGTGTGACA AGCAGACATG CATGTGTGAC AAGAACATGG 600 

TTCTGTGCCT CATGAACCAG ACGTACCGAG AGGAGTACCG TGGCTTCCTC AATGTCTACT 660 

GCCAGGGCCC CACGCCCAAC TGCAGCATCT ATGAACCGCC CCCTGAGGAG GTCACCTGCA 720 

GTCACCAATC CCCAGCGCCC CCCGCCCCTC CCTAGAGCCT CTGAGGTTTG AGAGAGAGAG 780 

CGGGAGGAGG GTCTGGCTTG GGGACCAGAC GAGGTGCAGG GAGGGTAGGA GCCAGGCCAG 840 

GAGCCTGAGG GTTGCTGGTT GCCTCCTCCC TGGAGCTCTC CAGTGAGGGC TCAGCTCTCA 900 

GAGGACTCAG GAAGGCCTGG GTCCTGACTC CCCCAGCCCA GCCCCAGGCA TGGGTGCCTC 960 

CTGCTGCTGG TTCTGGACTG GGTGGGAGGC ACGGAGCTTA TAGGGGTCTC TCCTGAGGGT 1020 

GGCCGGGGAG ACCTGAGAGA GAGGAGGAGG GGCCTCTGAG TGGGGCCTCT GTTGCTGGCG 1080 

CCAGTTTAAC TCCCCGGAGC CTTAGAAAGT CTGAGCCTTA GAAAGTCTGG GCCTGAGCAT 1140 

CCAGGCCCAG AGCTGGATGC ATCCTCGGCC CAAGATCACA GGAAGGCAGA TTGCTGGTCA 1200 

CAAAGCATGG GTTCCGGGAG CCCCTCAGCT GATCCCACAG GATGGCCTGG GGTGGTGGCT 1260 

ACTTTGGGCT TGAAGCTCTC TAGAGCCCCA TTCACAGAGC AGGTGCCCCA CCCCAGCCCT 1320 

CACTGAGGTG GCCATGAGTC CAGGTTCAGA CCCTAGGGGA TGTGCAGTGC TGAGTGTCCC 1380 

AGCCCTGAAG TCTAGCCTCA GAGCAGGCTC CTGGGCACTG ACAGAGCCAC ACACCCAGGT 1440 

ACACCCCCCA GGCTAATATG GGGACACACA AAAATATGGC CTTCAACACC TACAAGTCTT 1500 

CTCTGTGTCC CCCATCGACC CATGCACATC ACCCCAACCC TGCTCTGCCA CACCACGTGG 1560 

GTCTGAGGAG CACTCCTACC CCCCGGCTCC GGGTCCCTGA CAGACACTGC CCTCCTGGCC 1620 

TGCACCTGGA ATGGCAGCCA GAACACTGGA GGGGGACCCA CACCCTCTCT CCCTGGGCTG 1680 

TGGATGGGTG TTGCAAGTCC CATCCCTCCT GGCCTGCACC TGGAACAGCA GCCAGAACAC l '" in 
CGGAGGGGGA CCCACACCCT CTCTCCCTGG GCTGTGGGCA GGCGTCACAA GTCCCATTGG 
TGGGGAAGAG GCTGAGGGCT GAGACCCACC TCAGTGAAGA GGGAAACGTA AGCAGGGAGA 
GCATCAAGGC GGGAGGCAGC CTGGAAACTT CTAGAAGAGG GTCAAGGTAG GAGGCTACAT 
GGAAGCTTCT AGAAGGCAGT AAGGAACAGG GTGGTGAAGG AGAGGGGAGA AGATGGGAGC 

ATGGAGAGAC GAGGTGAGTG ACAGCCACCA GCAGGGACGG GCCTCCAGGC AACACAGAGG 2040 

AGCTGGGGGC CAGGGCCTTA CGGGGCTGGG TTCCTGTGTG GGGCGTGGAG CTGGGGCTGA 2100 

AGTGGAGGGG ACGGCCCTGC GGCCCCCAGC ACTGTTGTTT CTCCAGGTGC ACCTTACCTT 2160 

'CCTCCTTACC TGCCCGAGCC TCAACTCAGG GTCAGCAGCC TCCACGGAGC CAGCCCCACC 2220 

TGCCCGAAAA CCACTGAAGC CACCCTCCGC CCGACGTCAA CGGGGATCCA CTAGTTTAGA 2280 
GCGCGGCCCC CGCGTGC 

Seq ID NO: 80 Protein sequence: 
Protein Accession #: Eos sequence 
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MADGAKANPK GFKKKVLDRC PSGWRGPRFG ASCPSRTSRS SLGMKKFPTV AILAGSVLST 60 
AHGSLLNLKA MVEAVTGRSA ILSFVGYGCY CGLGGRGQPK DEVDWCCHAH DCCYQELFDQ 120 
D GCHPYVDHYD HTIENNTEIV CSDLNKTECD KQTCMCDKNM VLCLMNQTYR EEYRGFLNVY 180 
CQGPTPNCSI YEPPPEEVTC SHQSPAPPAP P 

Seq ID NO: 81 DNA sequence 
lA . Nu cleic Acid Accession ft: NM_018136.1 
1U Coding sequence: 38.. 2218 
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GAAATCTGTA CGAACTATTC AGGCTGCTTT TAGAGGCATG AAAGTTAGAC AAAAATTGAA 60 

AAATGTATCA GAGGAAAAGA TGGCAGCCAT .TGTTAACCAA TCTGCACTCT GCTGTTACAG 120 

AAGCAAAACT CAGTATGAAG CTGTTCAAAG TGAAGGTGTT ATGATTCAAG AGTGGTATAA 180 

AGCTTCTGGC CTTG CTTGT T CACAGGAAGC AGAGTATCAT TCTCAAAGTA GGGCTGCAGT 240 

AACAATTCAA AAAGCTTTTT GTAGAATGGT CACAAGAAAA CTGGAAACAC AGAAATGTGC 300 

TGCCCTACGG ATTCAGTTCT TCCTTCAGAT GGCTGTGTAT CGGAGAAGAT TTGTTCAGCA 360 

GAAAAGAGCT GCTATCACTT TACAGCATTA TTTTAGGACG TGGCAAACCA GAAAACAGTT 420 

TTTACTATAT AGAAAAGCAG CAGTGGTTTT ACAAAATCAC TACAGAGCAT TTCTGTCTGC 480 

AAAACATCAA AGACAAGTCT ATTTACAGAT CAGAAGCAGT GTTATCATTA TTCAAGCTAG 540 

AAGTAAAGGA TTTATACAGA AACGGAAGTT TCAGGAAATT AAAAATAGCA CCATAAAAAT 600 

TCAGGCTATG TGGAGGAGAT ATAGAGCCAA GAAATATTTA TGTAAAGTGA AAGCTGCCTG 660 

CAAGATTCAA GCCTGGTATA GATGTTGGAG AGCACACAAA GAATATCTAG CTGTATTAAA 720 

AGCTGTTAAA ATTATTCAAG GTTGCTTCTA TACCAAACTA GAGAGAACAC GGTTTTTGAA 780 

TGTGAGAGCA TCAGCAATTA TCATTCAGAG AAAATGGAGA GCT ATACTIC CTGCAAAGAT 840 

AGCTCATGAA CACTTCTTAA TGATAAAAAG ACATCGAGCT GCTTGTTTGA TCCAAGCACA 900 

TTATAGAGGA TATAAAGGAA GGCAGGTCTC TCTTCGGCAG AAATCTGCTG CTTTGATCAT 960 

ACAAAAATAT ATACGAGCCA GGGAGGCTGG AAAGCATGAA AGGATAAAAT ATATTGAATT 1020 

TAAAAAATCT ACAGTTATCC TACAAGCACT GGTGCGTGGT TGGCTAGTAC GAAAAAGATT 1080 

TTTAGAACAG AGAGCCAAAA TTCGACTTCT TCACTTCACT GCAGCTGCAT ATTATCACCT 1140 

GAATGCTGTT AGAATTCAAA GAGCCTATAA ACTTTACCTG GCTGTGAAGA ATGCTAACAA 1200 

GCAGGTTAAT TCAGTCATCT GTATTCAGAG ATGGTTTCGA GCAAGATTAC AAGAAAAGAG 1260 

ATTTATTCAG AAATATCATA GCATCAAAAA GATTGAGCAT GAAGGTCAAG AATGTCTGAG 1320 

CCAGCGAAAT AGGGCTGCAT CAGTAATACA GAAAGCAGTG CGCCATTTTC TCCTCCGTAA 1380 

AAAGCAGGAA AAATTCACTA GTGGAATCAT TAAAATTCAG GCATTATGGA GAGGCTATTC 1440 

TTGGAGGAAG AAAAATGATT GTACAAAAAT TAAAGCTATA CGACTAAGTC TTCAAGTTGT 1500 

TAATAGGGAG ATTCGAGAAG AAAACAAACT CTACAAAAGA ACTGCACTTG CACTTCATTA 1560 

CCTTTTGACA TATAAGCACC TTTCTGCCAT TCTTGAGGCC TTAAAACACC TAGAGGTAGT 1620 

TACTAGATTG TCTCCACTTT GTTGTGAGAA CATGGCCCAG AGTGGAGCAA TTTCTAAAAT 1680 

ATTTGTTTTG ATCCGAAGTT GTAATCGCAG TATTCCTTGT ATGGAAGTCA TCAGATATGC 1740 

TGTGCAAGTC TTGCTTAATG TATCTAAGTA TGAGAAAACT ACTTCAGCAG TTTATGATGT 1800 

AGAAAATTGT ATAGATATAC TATTGGAGCT TTTGCAGATA TACCGAGAAA AGCCTGGTAA 1860 

TAAAGTTGCA GACAAAGGCG GAAGCATTTT TACAAAAACT TGTTGTTTGT TGGCTATTTT 1920 

ACTGAAGACA ACAAATAGAG CCTCTGATGT ACGAAGTAGG TCCAAAGTTG TTGACCGTAT 1980 

TTACAGTCTC TACAAACTTA CAGCTCATAA ACATAAAATG AATACTGAAA GAATACTTTA 2040 

CAAGCAAAAG AAGAATTCTT CTATAAGCAT TCCTTTTATC CCAGAAACAC CTGTAAGGAC 2100 

CAGAATAGTT TCAAGACTTA AGCCAGATTG GGTTTTGAGA AGAGATAACA TGGAAGAAAT 2160 . 

CACAAATCCC CTGCAAGCTA TTCAAATGGT GATGGATACG CTTGGCATTC CTTATTAGTA 2220 

AATGTAAACA TTTTCAGTAT GTATAGTGTA AAGAAATATT AAAGCCAATC ATGAGTACGT 2280 

AAAGTGATTT TTGCTCTCTG TGTACAACTT TTAAAATCTG ACTTTGTTTT AAAAAAACAT 2340 

AAACTGTTCA TTACATTCTT CATTTTTATC ATTTATAGTT TTATGCATGT AATAAACTAA 2400 
TATGTCATAA GATG 

Seq ID NO: 82 Protein Sequence 
Protein Accession ft: NP 060606.1 
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MKVRQKLKNV SEEKMAAIVN QSALCCYRSK TQYEAVQSEG VMIQEWYKAS GLACSQEAEY 60 

HSQSRAAVTI QKAFCRMVTR KLETQKCAAL RIQFFLQMAV YRRRFVQQKR AAITLQHYFR 120 

TWQTRKQFLL YRKAAWLQN HYRAFLSAKH QRQVYLQIRS SVIIIQARSK GFIQKRKFQE 180 

IKNSTIKIQA MWRRYRAKKY LCKVKAACKI QAWYRCWRAH KEYLAVLKAV KIIQGCFYTK 240 

LERTRFLNVR ASAIIIQRKW RAILPAKIAH EHFLMIKRHR AACLIQAHYR GYKGRQVSLR 300 

QKSAALIIQK YIRAREAGKH ERIKYIEFKK STVILQALVR GWLVRKRFLE QRAKIRLLHF 360 

TAAAYYHLNA VRIQRAYKLY LAVKNANKQV NSVICIQRWF RARLQEKRFI QKYHSIKKIE 420 

HEGQECLSQR NRAASVIQKA VRHFLLRKKQ EKFTSGIIKI QALWRGYSWR KKNDCTKIKA 4 80 

IRIiSLQWNR EIREENKLYK RTALALHYLL TYKHLSAILE ALKHLEWTR LSPLCCENMA 540 

QSGAISKIFV LIRSCNRSIP CMEVIRYAVQ VLLNVSKYEK TTSAVYDVEN CIDILLELLQ 600 

IYREKPGNKV ADKGGSIFTK TCCLLAILLK TTNRASDVRS RSKWDRIYS LYKLTAHKHK 660 

MNTERILYKQ KKNSSISIPF IPETPVRTRI VSRLKPDWVL RRDNMEEITN PLQAIQMVMD 720 
TLGIPY 
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TTCTCCCGCA ACCTTCCCTT CGCTCCCTCC CGTCCCCCCC AGCTCCTAGC CTCCGACTCC 60 

CTCCCCCCCT CACGCCCGCC CTCTCGCCTT CGCCGAACCA AAGTGGATTA ATTACACGCT 120 

TTCTGTTTCT CTCCGTGCTG TTCTCTCCCG CTGTGCGCCT GCCCGCCTCT CGCTGTCCTC 180 

TCTCCCCCTC GCCCTCTCTT CGGCCCCCCC CTTTCACGTT CACTCTGTCT CTCCCACTAT 240 

CTCTGCCCCC CTCTATCCTT GATACAACAG CTGACCTCAT TTCCCGATAC CTTTTCCCCC 300 

CCGAAAAGTA CAACATCTGG CCCGCCCCAG CCCGAAGACA GCCCGTCCTC CCTGGACAAT 360 

CAGACGAATT CTCCCCCCCC CCCCAAAAAA AAAAGCCATC CCCCCGCTCT GCCCCGTCGC 420 

ACATTCGGCC CCCGCGACTC GGCCAGAGCG GCGCTGGCAG AGGAGTGTCC GGCAGGAGGG 480 

CCAACGCCCG CTGTTCGGTT TGCGACACGC AGCAGGGAGG TGGGCGGCAG CGTCGCCGGC 540 

TTCCAGACAC CAATGGGAAT CCCAATGGGG AAGTCGATGC TGGTGCTTCT CACCTTCTTG 600 

GCCTTCGCCT CGTGCTGCAT TGCTGCTTAC CGCCCCAGTG AGACCCTGTG CGGCGGGGAG 660 

CTGGTGGACA CCCTCCAGTT CGTCTGTGGG GACCGCGGCT TCTACTTCAG CAGGCCCGCA 720 

AGCCGTGTGA GCCGTCGCAG CCGTGGCATC GTTGAGGAGT GCTGTTTCCG CAGCTGTGAC 780 

CTGGCCCTCC TGGAGACGTA CTGTGCTACC CCCGCCAAGT CCGAGAGGGA CGTGTCGACC 840 

CCTCCGACCG TGCTTCCGGA CAACTTCCCC AGATACCCCG TGGGCAAGTT CTTCCAATAT 900 

GACACCTGGA AGCAGTCCAC CCAGCGCCTG CGCAGGGGCC TGCCTGCCCT CCTGCGTGCC 960 

CGCCGGGGTC ACGTGCTCGC CAAGGAGCTC GAGGCGTTCA GGGAGGCCAA ACGTCACCGT 1020 

CCCCTGATTG CTCTACCCAC CCAAGACCCC GCCCACGGGG GCGCCCCCCC AGAGATGGCC 1080 

AGCAATCGGA AGTGAGCAAA ACTGCCGCAA GTCTGCAGCC CGGCGCCACC ATCCTGCAGC 1140 

CTCCTCCTGA CCACGGACGT TTCCATCAGG TTCCATCCCG AAAATCTCTC GGTTCCACGT 1200 

CCCCCTGGGG CTTCTCCTGA CCCAGTCCCC GTGCCCCGCC TCCCCGAAAC AGGCTACTCT 1260 

CCTCGGCCCC CTCCATCGGG CTGAGGAAGC ACAGCAGCAT CTTCAAACAT GTACAAAATC 1320 
GATTGGCTTT AAACACCCTT CACATACCCT CCCCCC 

Seq ID NO: 84 Protein Sequence 
Protein Accession NP_000603.l 

1 11 21 31 41 51 

1 | | I I I 

MGIPMGKSML VLLTFLAFAS CCIAAYRPSE TLCGGELVDT LQFVCGDRGF YFSRPASRVS 60 

RRSRGIVEEC CFRSCDLALL ETYCATPAKS ERDVSTPPTV LPDNFPRYPV GKFFQYDTWK 120 

QSTQRLRRGL PALLRARRGH VLAKELEAFR EAKRHRPLIA LPTQDPAHGG APPEMASNRK 180 



Seq ID NO: 85 DNA sequence 

Nucleic Acid Accession XM_092643.l 

Coding sequence: 1-5352 
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ATGACTGAAG CCATTACAGA AGCAGCAGTA GCCTCAAGTG AGGAGGTGTC AGAGGAAAGA 60 

GACGACCTGG GGCCTCTTGA ATTACATGAC AGTGGTACCT TTCAACAAGT CGTGAACCTT 120 

CTGGACATCA TTGACAGCGA GTCAGCAAAG ACGGACACAA CAGGGGCAGG CCTTGACATG 180 

CGGAAGACCC TGGCCTCGGT GATAATCATG GAGAAGGCCA CCACTGAGCC TTCTGTAGTG 240 

ATAAACACTC TCATCCGCTG CCTGCAGGTG CCAGAGATTT CCACCCAGCG CAAGGTCAAC 300 

ATTTACAACA TCCTCCAGGA CATCATCCAG CAGGAGGGGG AGCTGGAGGA GCAGTGCGTG 360 

CAGAGGCTGG TGGCCATTGC CTCCAAGGAG ATGAGGGAGA TCCCAGAGAT GGAGGGCTAT 420 

ATGAAGGCAG AGGTGGCCAG CGACACACTG GTGGCTCTGT CCCGAAACCA CTTCAGCTTG 480 

GTCATGTACG AGCTGCAGCA CCACCTCAAG CCCCTCAACC TCACTGATGA ATTTGTCATC 540 

ATCACACTGG CCAAGCTGGC CAACGGCAAT GTGTTTGAGT TCATGCCATA CATGGGCATC 600 

ACCCTGGCTA CCATATTCAC CATGCTGAGA CTTGCCAATG AAGCCAAGAT ACGCCAGGCG 660 

ATCTGCAGTG CCATGGAGAC CTTCTGTGAG ACGGTGCAGT TTTATCTGAA GCACCTGGAG 720 

GAGAGCGTGT ACCCCGTGAT GACTGAGGAG GAGTTTGCCC TGAAGGTGTT CCCCATGTAT 780 

CGCTACTTCG TGACAGTGTG GCTGAGGCAC TACAACCCCG AGGTGAAGCT GGGGGTGATC 840 

AAGTCCCTGA AGCCCATGCT CGGCCTCCTT CTGCCCAACG ATGACCTGCG GGAGCAGGTC 900 

TACGACTACA TCCCCCTGCT GCTGGCGGAG TACCAGGGCA GTCTGGAGGT CTTGAGGCAG 960 

ATCCTGGAAC TGTCAGTCAC CACCAACACC CCTGTCCCCC AAATGCAGCT ACACACCATT 1020 

TTCACAGAAC TGCACGTCCA GGTGTGCAAC AAGGCCCCGG CCCAGCATCA GTACAGCAGC 1080 

CAGAATCTGA TGGAGATGGT GCACTGCTTC GTAGCCCTTG CTCGCTCCTA CCCCAAGGAG 1140 

CTGATGAAGT TCTTCTTCAG CCAGATGGAG ACAAACAAGG AGGCCGTCCG CGTGGGGACT 1200 

CTGAATCTGA TTAGGGCTAT AGTGAGCGCA GATGAGCCCA GGATGAGTAT CAGGGCCATC 1260 

TACCTGGCTA TCCGGGTAGT CAAGAACACC ATCTCTGATA CCCGGTCCAA GGTGAGGATG 1320 

GCTATTCTCC ACATCATTGG GCAGTTGGCT CTCTGTGGCT ACCAGGAGAG AATCAAAGGC . 1380 

TGGGGCCTGA AGTACCTGTC TGTGCAGCTG ACCTTATCCA CCTACAAACT GACAAATCGC 1440 

CGGGAGAAGT TTTATCAGAG GGACTTGGAG GAGAGGATGG TCCACAAAGT CACCATGGAC 1500 

ACTGTGAAGA TCATTACCTC TTCTGTCAGT GGGATGACCA CCGAGTTTTG GGTGAGGCTG 1560 

CTGTGCTACA TCATGGAGAC AGACTACGTG GAAGCTTTGA CTCCTATCTG TATCAGCCTC 1620 

ACAAACCTGG CAGAACACCA GCTCCATGGC CAGGATGTGG ATGTCAGCGT GGCTGGCAAG 1680 

AGCAGGCAAG TGGACCTGCC TGCACCTCAG AAGCTGCTGG CCCGTCTCCT GGTGCTGATG 1740 

TCATCACCTT ACAAGGGGGA GGGTCGTGGG ATAGCCATGC TCAACCTCTT GAGGACCCTG 1B00 

AGCCAGAGCA TCGCACCCTC CATGGCCGAC ATGTGGGAGC TGGAGATTGC GCTACTGGTC 1860 

CGGTACCTGG AAGAACATAC TGAGTTCACT TGGGATCAGA AAGCCTGGGA AGACAAGCTG 1920 

ATTCAGTTTC TGCGAAACTC CCTCAAGAAG ACCCGGGGGT CTAGCTGGAG CCTGCGCTTG 1980 

AGTAAAGAGC TGAACAACCA GATTGCGAGC TTTGACAGCC CCTCTCTGGA GAAGGGCTTT 2040 

CTGTACCGGG CCTTGGGCTT CACCTTGGCC ACAGGCCTGG AGGCCAGCAA GGTGGAGGTC 2100 

CTGCTGTTGG AGCTGCTGTA CAAGACGGAC TACAGCAATG ACTTTGACAG CGAGGGTGTG 2160 
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ATTATGTGCT TTGGCCTGTG TGCCCGGGGC CAGGTAAAAA CGGTGCTGAA TGTGCTTCAT 2220 

GACTTCGAGG AGAGGATCCA GGAGTCAGAG CAGTCCTGGC AGATCAGTGC TTGGCGGAAG 2280 

GACCATCCCT GGAGGOGGGA GACAGTGAAA AGTGCCCTCA TGGTGATGTA TAGCTGCGTG 2340 

GCCTCCTACT GCCACCCCCA GTTGCTCCTC AACCTCGTGG ACAGCCCCAT CACCGCTAAG 2400 

ATCATTCACC ATTATGTCAG CAGCTGCCAG GACATCTGTC TCAAAATGGC CTTCATGAAG 2460 

AGTGTTGTGC AGGTTACCAA GGCCATCAAC AACATCAAGG ACCTGGAGGA CTTTCACTTT 2520 

GCCCAGAAGA CGACTCTTAC CAGCATTATA GTGGCGGTCA TCAAGGCAGA ACCGACTGAC 2580 

AACCTGGTTT CTCCAGTGCG AGCCTTGGCG ATGGAGGCCC TCTCGCACCT GAGCAAGCTG 2640 

AAGCCTTTCT ACTCCACAGA GGAAAACAGT GAGCTGATGG ATATCAGCAT ACATTCTGTA 2700 

ATTTCTCTCC AACTCCCAGG AGAGGACAAT GAGTCCATTA AGACCCTGTA TGCAAATGCC 2760 

CTGAGCTCCC TGGAGCAGCT GATGGAGAGC CTCCTGCAGA GGCAGCTGGA CCCCAAGGGG 2820 

CTGCAGGAGA TGGTGCAGCT CCTGGAAAAG TGGATCTTGT CGGAGAAAGA ATGGGAGCGG 2880 

GAAAAGGCCG TGAGCCTCCA TCTCTATCTC ATGTGGATTT ATGTCCACAG CACTGCTGTC 2940 

TGTATCCACC TAAAGCTGGG GCAGTTTGGC ACAATGGTCG GACTCATTGC CCCGTGCACC 3000 

TGTGATGCCC ATCAAAGAAC CCGCATGGCC TCAATGAATG TCCTGTCCAG CCTGCTAGAT 3060 

CTTCACGCAA GCCAGACCTG CTCCTTGTGG GGCCCTTCCA AGCAGAAGGA GCTTGAGAAA 3120 

TGTAAGGGGG ACCTCCAGAG CACAGATGTG GAGAAGATCT TCTGTGCATC CTCCAGAATC 31 BO 

GCCAAGGTGG TCTGCATGGA GTTTAGCTGC GATGAGGTGG TCTCGCTCAT CCAGAAGCTC 3240 

TGCGAGAACA CTGGGGCCAT GAACCTGCAG CATGACAAGG CCTCTGTCAC CTGGATAGCC 3300 

TTCTTCCTCC AGATGCGGGC CAAGGAGCTG GAGGACAAGG TGGCCGAGAT CCTGAGTGCC 3360 

ATCCTGGTGC ACCTGCCGGT GGTGGACCAC CCAGAGGTGC GGCGCCTTCT CATTGACGGC 3420 

ATCCTGCTGC TGGCGCACCA CCACCAGGAG ACCATCCTCA CATCGCTCCT GAGGCAGCCA 3 480 

CTGCCCATGG AGAGCCACCT GGCAGAGGTG TGGCTGGCAG TGTCGGAGAA CGTGCCCTTC 3540 

GCCCGGACCA TGCTCCACAG CCTGATGGGC CGGCTGCAGT CACGGCTCAG CCCCAGAATC 3600 

AGTGCCACCT CCAAGGCTGA CATCTGGCGC CTGGCTGCGG TGGACCCCCT GATGACCCTG 3 660 

TGCACCATCC ACCTTCTCAT TCAGAAGCTG GATGAGAATG ACAAGCTCCC GGACTTCCTC 3720 

CCTGACCTCA TCTACACCCT CCTGCTGCAG CTTGGAAGCA GCCACCGACC AGAGGCCGCC 3780 

CCGCCGGTCT TGAAGATGTG GAAGCTGGTC CACACCACTC CTCTGCCGGA GGAGATGAAC 3840 

CTGCAAAGGT GCTCTCGAGG GCGGGTCACT ATCAAGTCCA TGCAGCTCTT GTTCAAGAGA 3 900 

GTCAAGAGCC AGCACCTGGC ACATACCCTG GACGAGCAGG CAGTGTGGGA CCTCCTGCAG 3960 

GACGGCGGGA CATTCCTGGA GGGTGTGAGC CTGCTGGCCA GGCTGTGCAT GCAGCACGTG 4020 

GAGGGCCACA GGCAGAGGCT GGCCGAGCTG GTGCTCAGGG GCATGGACTC AGAAGTCCTG 4080 

AGCTGCCGCA TCAGCAGCAC AGCGGTCTGC GTGGAAATGA GGCACCGGTT CATGAGCGGC 4140 

CCAGTTCTGT ACCAGGAGAA GCTGCTGAAG CCGGCAGCTT TGCTGCTGGA GAAGGGTGCC 4200 

GACCAGGAGG AAGACGAGGC CCTGCGGGTG CTGTCCCTGC GCGCCCTCGG CAACATGGCC 4260 

CTGGGCGCCC CCAAGAAGGT GAAGCAGTAC CGGAAGGTCT TGCTGGAGAA GTGCCTGGGC 4320 

CCCCTGAGGG AGCCCGTGAG CAACAGCGTG ACTGCCGAGG GCATGGAGGC CCTGACCAAG 4380 

ATCCTGGCTG AGCTCCGGGA AGGGGATGTG GGGTCCTCTT TCGACGCCAT GTCTGAGCAG 4440 

TGCAGGATCT TCTTCGACAA CGAGAGCGAG CTGCTGCGTC TGAAAGCCTT CATCCTCTTT 4500 

GGAAAGCTGG CAAGGGTGGT CGGGATGTCC AAGAAGCATT TCTTCAAAGG GGAGGTGAAG 4560 

AAGGCCTGGA TCCCCCTCAT GCTGCACTCC CAGGACCCCT GCTCCAATGC AGCCCAAGTA 4620 

AGATACATCC TGGGCTTTGT GTCCCAGTCT GGGGCCCGCT GTTCCCCGAG GAAACAGGTC 4680 

CTTGGGTCTG CTGGCTCGGC CTGTATGGCT ACCATGTTTC AGTGTGTGCA CTTCTGGGGC 4740 

TGGAAGTCCC TGGAGCATCC CTCAGGGCCA AGTGATACCG CTACTGATGA CAAGATGACC 4800 

GTTTTCCAGA CAACCATGTG CTCCATCCTG ACTCGGAAAA AGCCGGCTGT TCTCTACCGC 4860 

TTCTTGCTAG AAACAATGGC CTATGTTAAA AATAACTTGT CAAGAATCAG AATCGCTGCT 4 920 

TGCAACTTGG CAGGAATTAT TATGAAGCAG ATGTCTACAC ATTATCTGAA AAAGCTGGAC 4980 

TTCCCAGCAT TACGGAATTC CCTCCAGGAA CTACAGCTGG ACCCGGATCC CGGGCCCACT 5040 

GGCCCAAAGG GTCGGAAACT AGCTTTGTGT CTTCACTCGG AGTTCATCCA CACGCATGCT 5100 

GACAGCGTGC CTAAGGCCAG GCACTGTGCC AAGCGAGCAG GTAAACCAGG ACTTCCAAAC 5160 

TGTTCCCAGT CCAACTGCCA AAGGAATTCA CCATCCGGGG TCCCATCAAC AGGAGAGAGC 5220 

TGCTATGCTG TCCAAAATCC AACCCGTGCT AAAGCCTTCT CTGGAACAGT GGTCTTCAAA 5280 

CTGTTTTGTG GAGCAAATGG TGACTTGATT TCCTGCCCAG TGATAAAGAG TATTGCTGCT 5340 
AAACAAGCTT GA 

Seq ID NO: 86 Protein sequence: 
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MTEAITEAAV ASSEEVSEER DDLGPLELHD SGTFQQWNL LDI IDSESAK TDTTGAGLDM 60 

RKTLASVIIM EKATTEPSW INTLIRCLQV PEISTQRKVN IYNILQDIIQ QEGELEEQCV 120 

QRLVAIASKE MREIPEMEGY MKAEVASDTL VALSRNHPSL VMYELQHHLK PLNLTDEFVI 180 

ITLAKLANGN VFEFMPYMGI TLATIFTMLR LANEAKIRQA ICSAMETPCE TVQFYLKHLE 240 

ESVYPVMTEE EFALKVFPMY RYFVTVWLRH YNPEVKLGVI KSLKPMLGLL LPNDDLREQV 300 

YDYIPLLLAE YQGSLEVLRO. ILELSVTTNT PVPQMQLHTI FTETiHVQVCN KAPAQHQYSS 360 

QNLMEMVHCF VALARSYPKE LMKFFPSQME TNKEAVRVGT LNLIRAIVSA DEPRMSIRAI 420 

YLAIRWKNT ISDTRSKVRM AILHIIGQLA LCGYQERIKG WGLKYLSVQL TLSTYKLTNR 480 

REKFYQRDLE ERMVHKVTMD TVKIITSSVS GMTTEFWVRL LCYIMETDYV EALTPICISL 540 

TNLAEHQLHG QDVDVSVAGK SRQVDLPAPQ KLLARLLVLM SSPYKGEGRG IAMLNLLRTL 600 

SQSIAPSMAD MWELEIALLV RYLEEHTEFT WDQKAWEDKL IQFXRNSLKK TRGSSWSLRL 660 

SKELNNQIAS FDSPSLEKGF LYRALGFTLA TGLEASKVEV LLLELLYKTD YSNDFDSEGV 720 

IMCFGLCARG QVKTVLNVLH DFEERIQESE QSWQISAWRK DHPWRRETVK SALMVMYSCV 780 

ASYCHPOUjL NLVDSPITAK IIHHYVSSCQ DICLKMAFMK SWQVTKAIN NIKDLEDFHF 840 

AQKTTLTSII VAVIKAEPTD NLVSPVRALA MEALSHLSKL KPFYSTEENS ELMDISIHSV 900 

ISLQLPGEDN ESIKTLYAKA LSSLEQLMES LLQRQLDFKG LQEMVQLLEK WILSEKEWER 960 

EKAVSLHLYL MWIYVHSTAV CIHLKLGQFG TMVGLIAPCT CDAHQRTRMA SMNVLSSLLD 1020 

LHASQTCSLW GPSKQKELEK CKGDbQSTDV EKIPCASSRI AKWCMEFSC DEWSIiIQKL 1080 
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CENTGAMNLQ 
I LLLAHHHQE 
SATSKADIWR 
PPVLKMWKLV 
DGGTFLEGVS 
PVLYQEKLLK 
PIjREPVSNSV 
GKLARWGMS 
LGSAGSACMA 
FLLETMAYVK 
GPKGRKLALC 
CYAVQNPTRA 



HDKASVTWIA 
TILTSLLRQP 
LAAVDPLMTL 
HTTPLPEEMN 
LLARLCMQHV 
PAALLLEKGA 
TAEGMEALTK 
KKHFFKGEVK 
TMFQCVHFWG 
NNLSRIRIAA 
LHSEFIHTHA 
KAPSGTWFK 



FFLQMRAKEL 
LPMESHLAEV 
CTIHLLIQKL 
LQRCSRGRVT 
EGHRQRLAEL 
DQEEDEALRV 
ILAELREGDV 
KAWIPLMLHS 
WKSLEHPSGP 
CNLAGI IMKQ 
DSVPKARHCA 
LFCGANGDLI 



EDKVAEILSA 
WLAVSENVPF 
DENDKLPDFL 
IKSMQLLFKR 
VLRGMDSEVL 
LSLRALGNMA 
GSSFDAMSEQ 
QDPCSNAAQV 
SDTATDDKMT 
MSTHYLKKLD 
KRAGKPGLPN 
SCPVIKSIAA 



ILVHLPWDH 
ARTMLHSLMG 
PDLIYTLLLQ 
VKSQHXAHTL 
SCRISSTAVC 
LGAPKKVKQY 
CRIFFDNESE 
RYILGFVSQS 
VFQTTMCSIL 
FPALRNSLQE 



PEVRRLLIDG 
RLQSRLSPRI 
LGSSHRPEAA 
DEQAVWDLLQ 
VEMRHRFMSG 
RKVLLEKCLG 
LLRLKAFILF 
GARCSPRKQV 
TRKKPAVLYR 
LQLDPDPGPT 
PSGVPSTGES 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



KQA 



Seq ID NO: 87 DNA sequence 

Nucleic Acid Accession &: CAT cluster 



CACTTCTCCC 
TAAAACCAAA 
GGGTATCAGG 
AAAGCGCCTG 
TTGGGTGGTC 
CAGCCTGGGG 
CTGCAGGTCA 
CCTTTCACGG 
CAGGCCTTTT 
GTGGTATTGC 
TCTAGTCCAA 
TAAAGAAGTG 
ATGAGGAGAA 



11 

I 

TCTCCTTAGT 
GTAATGTTGT 
ACGCAAATGC 
TTACACTTGT 
CTCTCACAGA 
TGGTGAGTCA 
GCCCTGAGGA 
CAGGGGGAAA 
GAAGAGCTTA 
TGGGGACTAC 
CTGGCCATCC 
CCTCACGTGG 
CCCGATGA 



21 
I 

TCTCCGGTAA 
ACAGGTTAAA 
TTGGCTGGAA 
GTGACAGGCA 
CCGGGCATGA 
TGCAAGACTC 
CTATCCGGCC 
AGTTAGCATT 
CCAATCAGTC 
TGCTGGGTAC 
CTTTCACCCT 
GTCCTTGACC 



31 
I 

CCAAACCTTC 
GATTCAAAAC 
TGGATTAAAC 
GGCCAGAGAC 
GCTGTGTGGT 
TTTCACTGCT 
TCCAGCCCCT 
CCTTGGAGAC 
TGCCCTTGTT 
TCTGCCAAGT 
AGCATTCTGT 
CCCATGTTTA 



41 
I 

TCTCGTTATA 
AAACCATAGC 
ATTCTGTTGC 
CCAAATTGTT 
AGCTCTCTTC 
GTTCCCTGAG 
GATGTTAACT 
TTAACAGGGT 
CATCCCCGAG 
AATTAGAGCA 
CAATATGATA 
TATAAACACT 



51 
I 

TACCCAATGA 
AACTGAAACA 
AGGCTAAACA 
CCCTTTCCAC 
CAGAACCCCA 
GTCAAGAGCC 
TCACCTCATG 
GCAGTGAAAC 
CAGATGTGTG 
GCACTCATGC 
AAAAAAAATG 
ATTGGGTTCT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 



Seq ID NO: 88 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 
Coding sequence: 1..759 



1 
I 

ATGCAGGAAG 
GTAACTCTGT 
GTGAACACAC 
GACTCCAGAA 
ACTGATAATT 
CACACATGTT 
CCCTTCAGGA 
TACAATGGGT 
CCTAATAATT 
TTACAGAACC 
ACATTTGGAA 
CATAAGGAAT 
GACCATGTCC 



11 
I 

AAACAGGCTG 
GCACAGACCC 
CTCACCAGGT 
TCTTAAAGTA 
CACTTAACCC 
TAGTTTTAAT 
CGGGACAACA 
ATTCAGTAAT 
GGTCTGCTCA 
AGGAAGGAAC 
AAATTTGGAC 
TAATCACCCA 
CCGGACACCA 



21 
I 

GCAGCTTGGG 
AGAAGAAAGT 
CAGAACTATC 
TGAGGCTATT 
AGCAGGTTTC 
TAATTACCAT 
CTTTATAGAT 
TGATAGAGAA 
AACATGTGAG 
CATCTATACA 
TGAACGAGGT 
AGTATTAACT 
AAAAAGCCTT 



31 
I 

GAAAGGATTC 
AGAAAAATAA 
CTAAGTCAGA 
CTCTTAGAAA 
CTAACAGGGG 
ACAAAGGTCC 
GGTTCCTCCC 
ACTCTTGTAG 
CTGTTTGCAC 
V GATTCTAAGT 
CTTATTAATA 
AACCTTCAGC 
TCTTTTTAA 



41 
I 

TTACATACCA 
GTTTTGGAGG 
AAGCAGAAAG 
AAGATGATTT 
ATCTATATCA 
GACCAGACCT 
AGGTGATAGA 
AAATAGAGTT 
TCAGCCAAGC 
ACGCCTTTGG 
GCAAATGCCA 
TGCCAAAAGA 



51 

I 

CGGCGACCAG 
AAACCTCATT 
GTGGCTTACT 
AACATTAACC 
AAAGAGAGAG 
AGGAGAAACT 
GGGAAAAAGA 
AGGAAAATTG 
CTTAAAGGAC 
AGTGGCTCAT 
AGACTTGGTC 
AACAGCTATT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 



Seq ID NO: 89 Protein Sequence 
Protein Accession #: FGENESH predicted 



I 

MQEETGWQLG 
DSRILKYEAI 
PFRTGQHFID 
LQNQEGTIYT 
DHVPGHQKSL 



11 21 31 41 51 

I I I I 1 

ERIUTYHGDQ VTLCTDPEES RKISFGGNLI VNTPHQVRTI LSQKAERWLT 
LLEKDDLTLT TDNSLNPAGF LTGDLYQKRE HTCLVLINYH TKVRPDLGET 
GSSQVIEGKR YNGYSVIDRE TLVEIELGKL PNNWSAQTCE LFALSQALKD 
DSKYAFGVAH TFGKIWTERG LINSKCQDLV HKELITQVLT NLQLPKETAI 



Seq ID NO: 90 DNA sequence 

Nucleic Acid Accession 8 ; NMJJ00756.1 

Coding sequence: 186.. 776 



1 
I 

AGAAACTCAG 
GAGAGTGGGA 
TCTCTGCAGA 
CTAACATGCG 
GCCCGCCATG 
AGCACCCTCA 
CGCAGGCTCG 
TCAACAAGAG 
GCAGCCGCCC 



11 
I 

AGACCAAGTC 
ACAGTAAAGA 
GAGGCGGCAG 
GCTGCCGCTG 
CAGGGCGCTC 
GCCCTTGGAT 
GCCGGTCCTG 
CCCGGCCGCT 
TTCGCCGGAA 



21 

I 

CATTGAGAGA 
GAAAGGAAGA 
CACCCGGCTC 
CTTGTGTCCG 
CTGAGCCGCG 
TTCTTCCAGC 
CTCCGCATGG 
CCCCTTTCGC 
CAGGCGACCG 



31 
I 

CTGAGGGGAA 
CAACCTCCAG 
ACCTGCGAAG 
CGGGAGTCCT 
GGCCGGTCCC 
CGCCGCCGCA 
GAGAGGAGTA 
CCGCCTCCTC 
CCAACTTTTT 



41 
1 

AGAGAGGAGA 
AGAAAGCCCC 
CGCCTGGGAA 
GCTGGTGGCT 
GGGAGCTCGG 
GTCCGAGCAG 
CTTCCTCCGC 
GCTCCTCGCC 
CCGCGTGTTG 



51 
I 

GAAAGAAAAA 
CGGAGACGTC 
GCGAGTGCCC 
CTCCTGCCCT 
CAGGCGCCGC 
CCCCAGCAGC 
CTGGGGAACC 
GGAGGCAGCG 
CTGCAGCAGC 



60 
12 0 
180 
240 



60 

120 

160 

240 

300 

360 

420 

480 

540 



264 



WO 03/003906 PCT/US02/ 21 338 



TGCTGCTGCC TCGGCGCTCG CTCGACAGCC CCGCGGCTCT OGOGGAGCGC GGCGCTAGGA 600 

ATGCCCTCGG CGGCCACCAG GAGGCACCGG AGAGAGAAAG GCGGTCCGAG GAGCCTCCCA 660 

TCTCCCTGGA TCTCACCTTC CACCTCCTCC GGGAAGTCTT GGAAATGGCC AGGGCCGAGC 720 

AGTTAGCACA GCAAGCTCAC AGCAACAGGA AACTCATGGA GATTATTGGG AAATAAAACG 780 

J GTGCGTTTGG CCAAAAAGAA TCTGCATTTA GCACAAAAAA AATTTAAAAA AATACAGTAT 840 

TCTGTACCAT AGCGCTGCTC TTATGCCATT TGTTTATTTT TATATAGCTT GAAACATAGA 900 

GGGAGAGAGG GAGAGAGCCT ATACCCCTTA CTTAGCATGC ACAAAGTGTA TTCACGTGCA 960 

GCAGCAACAC AATGTTATTC GTTTTGTCTA CGTTTAGTTT CCGTTTCCAG GTGTTTATAG 1020 

TGGTGTTTTA AAGAGAATGT AGACCTGTGA GAAAACGTTT TGTTTGAAAA AGCAGACAGA 1080 

1U AGTCACTCAA TTGTTTTTGT TGTGGTCTGA GCCAAAGAGA ATGCCATTCT CTTGGGTGGG 1140 

TAAGACTAAA TCTGTAAGCT CTTTGAAACA ACTTTCTCTT GTAAACGTTT CAGTAATAAA 1200 

ACATCTTTCC AGTCCTTGGT CAGTTTGGTT GTGTAAGAGA ATGTTGAATA CTTATATTTT 1260 
TAATAAAAGT TGCAAAGGT 

15 Seq ID NO: 91 Protein Sequence 

Protein Accession NP_000747.i 

1 11 21 31 41 51 

on 1 1 1 111 

L\J MRLPLLVSAG VLLVALLPCP pcrallsrgp VPGARQAPQH PQPLDFFQPP PQSEQPQQPQ 60 

ARPVLLRMGE EYFLRLGNLN KSPAAPLSPA SSLLAGGSGS RPSPEQATAN FFRVLLQQLL 120 

LPRRSLDSPA ALAERGARNA LGGHQEAPER ERRSEEPPIS UDLTFHLLRE VLEMARAEQL 180 
AQQAHSNRKL MEIIGK 

25 Seq ID NO: 92 DNA sequence 

Nucleic Acid Accession #: NM_0042n.i 
Coding sequence: 5 8.. 10 92 

ori 1 11 21 31 41 51 

30 i | i ill 

GGCCGGGAGA GTAGCAGTGC CTTGGACCCC AGCTCTCCTC CCGCTTTCTC TCTAAGGATG 60 

GCCCAGAAGG AGAACTCCTA CCCCTGGCCC TACGGCCGAC AGACGGCTCC ATCTGGCCTG 120 

AGCACCCTGC CCCAGCGAGT CCTCCGGAAA GAGCCTGTCA CCCCATCTGC ACTTGTCCTC 180 

ATGAGCCGCT CCAATGTCCA GCCCACAGCT GCCCCTGGCC AGAAGGTGAT GGAGAATAGC 240 

AGTGGGACAC CCGACATCTT AACGCGGCAC TTCACAATTG ATGACTTTGA GATTGGGCGT 300 

CCTCTGGGCA AAGGCAAGTT TGGAAACGTG TACTTGGCTC GGGAGAAGAA AAGCCATTTC 360 

ATCGTGGCGC TCAAGGTCCT CTTCAAGTCC CAGATAGAGA AGGAGGGCGT GGAGCATCAG 420 

CTGCGCAGAG AGATCGAAAT CCAGGCCCAC CTGCACCATC CCAACATCCT GCGTCTCTAC 480 

AACTATTTTT ATGACCGGAG GAGGATCTAC TTGATTCTAG AGTATGCCCC CCGCGGGGAG 540 

CTCTACAAGG AGCTGCAGAA GAGCTGCACA TTTGACGAGC AGCGAACAGC CACGATCATG 600 

GAGGAGTTGG CAGATGCTCT AATGTACTGC CATGGGAAGA AGGTGATTCA CAGAGACATA 660 

AAGCCAGAAA ATCTGCTCTT AGGGCTCAAG GGAGAGCTGA AGATTGCTGA CTTCGGCTGG 720 

TCTGTGCATG CGCCCTCCCT GAGGAGGAAG ACAATGTGTG GCACCCTGGA CTACCTGCCC 780 

CCAGAGATGA TTGAGGGGCG CATGCACAAT GAGAAGGTGG ATCTGTGGTG CATTGGAGTG 840 

CTTTGCTATG AGCTGCTGGT GGGGAACCCA CCCTTTGAGA GTGCATCACA CAACGAGACC 900 

TATCGCCGCA TCGTCAAGGT GGACCTAAAG TTCCCCGCTT CTGTGCCCAC GGGAGCCCAG 960 

GACCTCATCT CCAAACTGCT CAGGCATAAC CCCTCGGAAC GGCTGCCCCT GGCCCAGGTC 1020 

TCAGCCCACC CTTGGGTCCG GGCCAACTCT CGGAGGGTGC TGCCTCCCTC TGCCCTTCAA 1080 

TCTGTCGCCT GATGGTCCCT GTCATTCACT CGGGTGCGTG TGTTTGTATG TCTGTGTATG 1140 

TATAGGGGAA AGAAGGGATC CCTAACTGTT CCCTTATCTG TTTTCTACCT CCTCCTTTGT 1200 
TTAATAAAGG CTGAAGCTTT TTGT 



35 
40 
45 
50 



55 



Seq ID NO: 93 Protein Sequence 
Protein Accession #: NP 004208 



1 11 21 31 41 51 

I I I I I I 

MAQKENSYPW PYGRQTAPSG LSTLPQRVLR KEPVTPSALV LMSRSNVQPT AAPGQKVMEN 60 

^ A SSGTPDILTR HFTIDDFEIG RPLGKGKFGN VYLAREKKSH FIVALKVLFK SQIEKEGVEH 120 

60 QLRREIEIQA HLHHPNILRL YNYFYDRRRI YLILEYAPRG ELYKELQKSC TFDEQRTATI 180 

MEELADALMY CHGKKVIHRD IKPENLLLGL KGELKIADFG WSVHAPSLRR KTMCGTLDYL 240 

PPEMIEGRMH NEKVDLWCIG VLCYELLVGN PPFESASHNE TYRRIVKVDL KFPASVPTGA 300 
QDLISKLLRH NPSERLPLAQ VSAHPWVRAN SRRVLPPSAL QSVA 

65 Seq ID NO: 94 DNA sequence 

Nucleic Acid Accession #: NM_007019.1 
Coding sequence: 41.. 580 

„ 1 11 21 31 41 51 

70 i | | III 

GGCACGAGCG AGTTCCTGTC TCTCTGCCAA CGCCGCCCGG ATGGCTTCCC AAAACCGCGA 60 

CCCAGCCGCC ACTAGCGTCG CCGCCGCCCG TAAAGGAGCT GAGCCGAGCG GGGGCGCCGC 120 

CCGGGGTCCG GTGGGCAAAA GGCTACAGCA GGAGCTGATG ACCCTCATGA TGTCTGGCGA 180 

TAAAGGGATT TCTGCCTTCC CTGAATCAGA CAACCTTTTC AAATGGGTAG GGACCATCCA 240 

75 TGGAGCAGCT GGAACAGTAT ATGAAGACCT GAGGTATAAG CTCTCGCTAG AGTTCCCCAG 300 

TGGCTACCCT TACAATGCGC CCACAGTGAA GTTCCTCACG CCCTGCTATC ACCCCAACGT 360 

GGACACCCAG GGTAACATAT GCCTGGACAT CCTGAAGGAA AAGTGGTCTG CCCTGTATGA 420 

TGTCAGGACC ATTCTGCTCT CCATCCAGAG CCTTCTAGGA GAACCCAACA TTGATAGTCC 480 



265 
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CTTGAACACA CATGCTGCCG AGCTCTGGAA AAACCCCACA GCTTTTAAGA AGTACCTGCA 540 
AGAAACCTAC TCAAAGCAGG TCACCAGCCA GGAGCCCTGA CCCAGGCTGC CCAGCCTGTC 600 
CTTGTGTCGT CTTTTTAATT TTTCCTTAGA TGGTCTGTCC TTTTTGTGAT TTCTGTATAG 660 
GACTCTTTAT CTTGAGCTGT GGTATTTTTG TTTTGTTTTT GTCTTTTAAA TTAAGCCTCG 720 
GTTGAGCCCT TGTATATTAA ATAAATGCAT TTTTGTCCTT TTTTAAAAAA AAAAAAAAAA 780 
AAA 

Seq ID NO: 95 Protein Sequence 
Protein Accession #: NP_008950.1 

1 11 21 

1 ) I 

MASQNRDPAA TSVAAARKGA EPSGGAARGP 
KWVGTIHGAA GTVYEDLRYK LSLEFPSGYP 
KWSALYDVRT ILLSIOSLLG EPNIDSPLNT 



31 41 51 

VGKRLQQELM TLMMSGDKGI SAFPESDNLF 60 

YNAPTVKFLT PCYHPNVDTQ GNICLDILKE 120 

HAAELWKNPT AFKKYLQETY SKQVTSQEP 179 



Seq ID NO: 96 DNA sequence 
Nucleic Acid Accession #: AK055663 
Coding sequence: 38.. 1423 

1 ii 21 31 41 51 

I I I I I I 

AGAACGGCTT CCGGCGGGAG CTGTGCAGCT CCTTATCATG GGGACAATTC ATCTCTTTCG 60 

AAAACCACAA AGATCCTTTT TTGGCAAGTT GTTACGGGAA TTTAGACTTG TAGCAGCTGA 120 

CCGAAGGTCC TGGAAGATAC TGCTCTTTGG TGTAATAAAC TTGATATGTA CTGGCTTCCT 180 

GCTTATGTGG TGCAGTTCTA CTAATAGTAT AGCTTTAACT GCCTATACTT ACCTGACCAT 240 

TTTTGATCTT TTTAGTTTAA TGACATGTTT AATAAGTTAC TGGGTAACAT TGAGGAAACC 300 

TAGCCCTGTC TATTCATTTG GGTTTGAAAG ATTAGAAGTC CTGGCTGTAT TTGCCTCCAC 360 

AGTCTTGGCA CAGTTGGGAG CTCTCTTTAT ATTAAAAGAA AGTGCAGAAC GCTTTTTGGA 420 

ACAGCCCGAG ATACACACGG GAAGATTATT AGTTGGTACT TTTGTGGCTC TTTGTTTCAA 480 

CCTGTTCACG ATGCTTTCTA TTCGGAATAA ACCTTTTGCT TATGTCTCAG AAGCTGCTAG 540 

TACGAGCTGG CTTCAAGAGC ATGTTGCAGA TCTTAGTCGA AGCTTGTGTG GAATTATTCC 600 

GGGACTTAGC AGTATCTTCC TTCCCCGAAT GAATCCATTT GTTTTGATTG ATCTTGCTGG 660 

AGCATTTGCT CTTTGTATTA CATATATGCT CATTGAAATT AATAATTATT TTGCCGTAGA 720 

CACTGCCTCT GCTATAGCTA TTGCCTTGAT GACATTTGGC ACTATGTATC CCATGAGTGT 780 

GTACAGTGGG AAAGTCTTAC TCCAGACAAC ACCACCCCAT GTTATTGGTC AGTTGGACAA 840 

ACTCATCAGA GAGGTATCTA CCTTAGATGG AGTTTTAGAA GTCCGAAATG AACATTTTTG 900 

GACCCTAGGT TTTGGCTCAT TGGCTGGATC AGTGCATGTA AGAATTCGAC GAGATGCCAA 960 

TGAACAAATG GTTCTTGCTC ATGTGACCAA CAGGCTGTAC ACTCTAGTGT CTACTCTAAC 1020 

TGTTCAAATT TTCAAGGATG ACTGGATTAG GCCTGCCTTA TTGTCTGGGC CTGTTGCAGC 1080 

CAATGTCCTA AACTTTTCAG ATCATCACGT AATCCCAATG CCTCTTTTAA AGGGTACTGA 1140 

TGATTTGAAC CCAGTTACAT CAACTCCAGC TAAACCTAGT AGTCCACCTC CAGAATTTTC 1200 

ATTTAACACT CCTGGGAAAA ATGTGAACCC AGTTATTCTT CTAAACACAC AAACAAGGCC 1260 

TTATGGTTTT GGTCTCAATC ATGGACACAC ACCTTACAGC AGCATGCTTA ATCAAGGACT 1320 

TGGAGTTCCA GGAATTGGAG CAACTCAAGG ATTGAGGACT GGTTTTACAA ATATACCAAG 1380 

TAGATATGGA ACTAATAATA GAATTGGACA ACCAAGACCA TGATAGACTC TAACTTATTT 1440 

TTATAAGGAA TATTGACTCC TTGGCTTCCA ATTTATTTAG TAATCCAACT TTGCATTGAC 1500 

TGTTTAATCA TTTACTCTAA ATGTTAGATA ATAGTAGTCT TGTTCACATT TCATGAAACC 1560 

TATGAAACTA TATTTTTGTA AAATGTATTT GTGACAGTGA AATCCTCGTA AATGTTAAAG 1620 

GCTTTAAATA GGCTTCCTTT AGAAAATGTG TTTCTTTAAA TTTGGATTTT' GGTATCTTTG 1680 

GTTTTGTAGT TGACTGCAGT GTGATGTGAC CTTACCTTTA TAAGAGCCAC TTGATGGAGT 1740 
AGATCTGTCA CATTACTAAG ATACGATATT TCTTTTTTTT TCCGAGACGG AGTCTTGCTC 1800 
TGCCACTGTG CCCGGCCAAT ACATTATTAT TAACTTAAGG CTGTACTTTA TTAAGGCTTC 
CTTAGTTTTT GTTTTGTTTT GTTTTTTGAG ATGGAGTCTC ACTCTGTCGC CCAGGCTGGA 
ATGCAGTGGC ATGATCTCAG CTCACTGCAA CCTCTGCCTC CTGAGTTCAA ATGATTCTCC 1980 
TGCCTCAGCC TCCCGAGTAG CTGGGATTAC AGGCACCTGC CACCACGCCC AGCTAATTTT 2040 
TGTATTTTTA GTAAAGACGG GGGATTTCAC CATGTTGGCC AGGCTGGTCT TGAACTCCTG 2100 
zvrrTPATGAT CCACCCACCT TAGCCTCCCA AAGTGCTGGG ATTAGGTGTG AGCCACCGCA 2160 
ATTTTCTTTA ATGAAATTTA TAAATATGCT TCTTGAATAA TACACATTTT 2220 
GGGAAAGGGA AAAATGTCTG TTCAAAAAGT AAAGGTCTCT TTTATAGCTT TTCCAAACTT 2280 
AATTGCTAAA TTTTTCTTTG AGGTTCTCCT GAATTATGTC TTACAAACTA AAAGCAAAAA 2340 
TTTTTAGCAG AAATTTTGGA ATACATTCTA TCTAGCACAA TTTGAATTTT TAATTATCAA 2400 
GATTTTTGTT AAAGTTTCTC TCCTTTAAAA ATTTTAGTAC ATTTOTAAAT 



1860 
1920 



Seq ID NO: 97 Protein Sequence 
Protein Accession th BAB70980.1 

1 U . 21 31 41 51 

MGTIHLFRKP QRSFPGKLLR EFRIA/AADRR SWKILLFGVI NLICTGFLLM WCSSTNSIAL 60 

Sltifd LFSLMTCLIS YWVTLRKPSP VYSFGFERLE VLAVFASTVL AQLGALFILK 120 

ESAERFLEQP EIHTGRLLVG TFVALCFNLF TMLSIRNKPF AYVSEAASTS WLQEHVADLS 180 

RSLCGIIPGL SSIFLPRMNP FVLIDLAGAF ALCITYMLIE INNYFAVDTA SAIAIALMTF 240 

GTMYPMSVYS GKVLLQTTPP HVIGQLDKLI REVSTLDGVL EVRNEHFWTL GFGSLAGSVH 300 

VRIRRDANEQ MVLAHVTNRL YTLVSTLTVQ IFKDDWIRPA LLSGPVAANV LNFSDHHVIP 360 

MPLLKGTDDL NPVTSTPAKP SSPPPEFSFN TPGKNVNPVI LLNTQTRPYG FGLNHGHTPY 420 
SSMLNQGLGV PGIGATQGLR TGFTNIPSRY GTNNRIGQPR P 



266 
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PCT/US02/21338 



Seq ID NO: 9B DNA sequence 

Nucleic Acid Accession §: CAT cluster 

c 1 " 21 31 41 51 

5 I I I III 

ATGGAC TTAA CCAATGCTTC ACAAATACCA AGAAAAAAAT ATTTCAATGT GATAAATATA 60 

GGAAAGTTTT TCATAAATTT TCAAATTCAA ACTGACATAA GAGAAGACAT ACGGGAAAAA 120 

AGACCTTTCA AATGTATAGA ATGTGGCAAA GCTTTTAACA AGTTTTCAAC CCTTACTACA 180 

1 fi CATAAGAAAA TT CAT ACT AG AGAGAAACCC TACAAATGTG AAGAATGTGG CAAAGCCTTC 240 

1U AACTGGTCCT CATACCTTAC TATACATAAG AGAATTCATA CTGGAGAGAA ACTCTACAAA 300 

TGTGAAGAAT GTGGCAAAGC CTTCAACTGG TCCTCATACC TTACTGCATA TAAGATAACT 3 SO 

CATACTAGAG AGAAACCCTA CAAATGTGAA GAATGTGGCA AAGCCTTTAA GTACTCCTCG 420 

AACCTCACTA CACATAAGAT AATTCATACT GGAGAGCATC TCTACAAGTG TGAAGAATGT 4 80 
GGCAAAGCGT TTAACCATGC TGCATGCTTC TTTGTCATTT TGAAGATTTG A 



15 

20 



30 
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60 



65 



Seq ID NO: 99 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1..531 



L 11 21 31 41 51 

I I I I I I 

ATGGACTTAA CCAATGCTTC ACAAATACCA AGAAAAAAAT ATTTCAATGT GATAAATATA 60 

GGAAAGTTTT TCATAAATTT TCAAATTCAA ACTGACATAA GAGAAGACAT ACGGGAAAAA 120 

AGACCTTTCA AATGTATAGA ATGTGGCAAA GCTTTTAACA AGTTTTCAAC CCTTACTACA 180 

ZJ CATAAGAAAA TT CAT ACT AG AGAGAAACCC TACAAATGTG AAGAATGTGG CAAAGCCTTC 240 

AACTGGTCCT CATACCTTAC TATACATAAG AGAATTCATA CTGGAGAGAA ACTCTACAAA 300 

TGTGAAGAAT GTGGCAAAGC CTTCAACTGG TCCTCATACC TTACTGCATA TAAGATAACT 360 

CATACTAGAG AGAAACCCTA CAAATGTGAA GAATGTGGCA AAGCCTTTAA GTACTCCTCG 420 

AACCTCACTA CACATAAGAT AATTCATACT GGAGAGCATC TCTACAAGTG TGAAGAATGT 480 
GGCAAAGCGT TTAACCATGC TGCATGCTTC TTTGTCATTT TGAAGATTTG A 



Seq ID NO: 100 Protein Sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I I I I I I 

MDLTNASQIP RKKYFNVINI GKFFINFQIQ TDIREDIREK RPFKCIECGK AFNKFSTLTT 60 
HKKIHTREKP YKCEECGKAF NWSSYLTIHK RIHTGEKLYK CEECGKAFNW SSYLTAYKIT 120 
HTREKPYKCE ECGKAFKYSS NLTTHKI IHT GEHLYKCEEC GKAFNHAACF FVILKI 

Seq ID NO: 101 DNA sequence 

Nucleic Acid Accession # : NM_0 32589.1 

Coding sequence: 147-422 

1 11 21 31 41 51 

I I I I I 1 

GGCACGAGGT CTTGTTATGC AAATAGGCTT CCCACTTGGC AGGGGCCGTC TTGTCCACTC 60 
GTTTCTGTAA ACATGGGTGG CAAAAAGAGA AGATGGAGCT GCCATTTAGA ACATGCCTAA 120 
TCCCAGCTTC ATCTTGCTGA GCAAAAATGA AGGAGCCTGG ACCCAACTTT GTTACTGTGA 180 
GAAAGGGTCT TCATTCATTC AAGATGGCAT TTGTTAAGCA CCTACTGCTG GAGTGCAGTG 240 
GTTCAATCAC GGATCACTGC AGCCTCCACC TCCCAGTTCA AGAAATTCTC ATGTCTCAGC 300 
CTCCTGAGCA GCTAGGATTA CAGACAAACC TTGGAAATCA AGAAAGTTCT GGAATGATGA 360 
AGCTGTTCAT GCCAAGACCG AAAGTGCTGG CCCAGTATGA GTCCATTCAG TTCATGCCGT 420 
GACAATTTTC TTGGAACTCC TTTTTATTGT TAGTTCTCAC TTGTTTCCAT ATTTAGTGAA 480 
TGTACATTTA ATTGCAAAGC TGTCATTAAT AAAAATTCTT ATAGTACCTC ACTGCAC 



Seq ID NO: 102 Protein sequence: 
Protein Accession #: NP_115978.1 

1 11 21 31 41 51 

I I I 1 I I 

MKEPGPNFVT VRKGLHSFKM AFVKHLLLEC SGSITDHCSL HLPVQEILMS QPPEQLGLQT 60 
NLGNQESSGM MKLFMPRPKV LAQYESIQFM P 

Seq ID NO: 103 DNA sequence 

Nucleic Acid Accession #: NM_014501.1 

Coding sequence: 60.. 737 

70 1 11 21 31 41 51 

I I I I I I 

GGCGGACCGA AGAACGCAGG AAGGGGGCCG GGGGGACCCG CCCCCGGCCG GCCGCAGCCA 60 

TGAACTCCAA CGTGGAGAAC CTACCCCCGC ACATCATCCG CCTGGTGTAC AAGGAGGTGA 120 

CGACACTGAC CGCAGACCCA CCCGATGGCA TCAAGGTCTT TCCCAACGAG GAGGACCTCA 180 

75 CCGACCTCCA GGTCACCATC GAGGGCCCTG AGGGGACCCC ATATGCTGGA GGTCTGTTCC 240 

GCATGAAACT CCTGCTGGGG AAGGACTTCC CTGCCTCCCC ACCCAAGGGC TACTTCCTGA 300 

CCAAGATCTT CCACCCGAAC GTGGGCGCCA ATGGCGAGAT CTGCGTCAAC GTGCTCAAGA 360 

GGGACTGGAC GGCTGAGCTG GGCATCCGAC ACGTACTGCT GACCATCAAG TGCCTGCTGA 420 

267 



WO 03/003906 



TCCACCCTAA CCCCGAGTCT GCACTCAACG AGGAGGCGGG CCGCCTGCTC TTGGAGAACT 480 

ACGAGGAGTA TGCGGCTCGG GCCCGTCTGC TCACAGAGAT CCACGGGGGC GCCGGCGGGC 540 

CCAGCGGCAG GGCCGAAGCC GGTCGGGCCC TGGCCAGTGG CACTGAAGCT TCCTCCACCG 600 

ACCCTGGGGC CCCAGGGGGC CCGGGAGGGG CTGAGGGTCC CATGGCCAAG AAGCATGCTG 660 

GCGAGCGCGA TAAGAAGCTG GCGGCCAAGA AAAAGACGGA CAAGAAGCGG GCGCTGCGGG 720 

CGCTGCGGCG GCTGTAGTGG GCTCTCTTCC TCCTTCCACC GTGACCCCAA CCTCTCCTGT 780 

CCCCTCCCTC CAACTCTGTC TCTAAGTTAT TTAAATTATG GCTGGGGTCG GGGAGGGTAC 840 
AGGGGGCACT GGGACCTGGA TTTGTTTTTC TAAATAAAGT TGGAAAAGCA 

Seq ID NO: 104 Protein Sequence 
Protein Accession #: NP_055316.1 

1 11 21 31 41 51 

I i I I I I 

MNSNVENLPP HIIRLVYKEV TTLTADPPDG IKVFPNEEDL TDLQVTIEGP EGTPYAGGLP 60 
RMKLLLGKDF PASPPKGYFL TKIFHPNVGA NGEICVNVLK RDWTAELGIR HVLLTIKCLL 120 
IHPNPESALN EEAGRLLLEN YEEYAARARL LTEIHGGAGG PSGRAEAGRA LASGTEASST 180 
DPGAPGGPGG AEGPMAKKHA GERDKKLAAK KKTDKKRALR ALRRL 



Seq ID NO: 105 DNA sequence 
Nucleic Acid Accession #: NM_005101 
Coding sequence : 76 . . 573 



1 
I 

CGGCTGAGAG 
CCACAGCCCA 
CAGGTGTCCC 
ATTGGCGTGC 
GACAGGGTCC 
GACAAATGCG 
TACGAGGTCC 
GGTGTGCAGG 
CCGCTGGGGG 
GGAGGCGGCA 
CAAGGGCCGG 



11 
I 

GCAGCGAACT 
CAGCCATGGG 
TGAGCAGCTC 
ACGCCTTCCA 
CCCTTGCCAG 
ACGAACCTCT 
GGCTGACGCA 
ACGACCTGTT 
AGTACGGCCT 
CAGAGCCTGG 
AAATAAAGGC 



21 
I 

CATCTTTGCC 
CTGGGACCTG 
CATGTCGGTG 
GCAGCGTCTG 
CCAGGGCCTG 
GAGCATCCTG 
GACCGTGGCC 
CTGGCTGACC 
CAAGCCCCTG 
CGGGCGGAGC 
TGTTGTAAGA 



31 
I 

AGTACAGGAG 
ACGGTGAAGA 
TCAGAGCTGA 
GCTGTCCACC 
GGCCCTGGCA 
GTGAGGAATA 
CACCTGAAGC 
TTCGAGGGGA 
AGCACCGTGT 
TAAGGGCCTC 
GAAT 



41 

I 

CTTGTGCCGT 
TGCTGGCGGG 
AGGCGCAGAT 
CGAGCGGTGT 
GCACGGTCCT 
ACAAGGGCCG 
AGCAAGTGAG 
AGCCCCTGGA 
TCATGAATCT 
CACCAGCATC 



51 
I 

GGCCCACAGC 
CAACGAATTC 
CACCCAGAAG 
GGCGCTGCAG 
GCTGGTGGTG 
CAGCAGCACC 
CGGGCTGGAG 
GGACCAGCTC 
GCGCCTGCGG 
CGAGCAGGAT 



60 

120 

180 

240 

300 

3 60 

420 

480 

540 

600 



Seq ID NO: 106 Protein Sequence 
Protein Accession fh NP_005092 



51 



1 11 21 " I 1 | 

LwDLTVKML AGNEFQVSLS SSMSVSELKA QITQK1GVHA FQQRLAVHPS GVALQDRVPL 
ASQGLGPGST VLLWDKCDE PLSILVRNNK GRSSTYEVRL TQTVAHLKQQ VSGLEGVQDD 
LFWLTFEGKP LEDQLPLGEY GLKPLSTVFM NLRLRGGGTE PGGRS 

Seq ID NO: 107 DNA sequence 
Nucleic Acid Accession #: NM_006018 
Coding sequence: 1..1161 



11 
I 

ACCATCTGCA 
ACTTCATTGC 
GCAATGGCCT 
TTTTCCTGTT 
TGGACTACTA 
TCTTCATGTT 
ACAGGTATTT 
CAGCCATCAT 
AGAAGAAGTT 
ATACCTTCCG 
TCCTGTTCTG 
CCAAGATCAA 
TCCTTCCCAG 
AGAATTGTGA 
ACATGAACAG 
TCTTCTCCAC 
ACCGCAGCAC 
CGTTAATGGC 
ACCATTCCAA 
GTTGCATCGA 



ATGAATCGGC 
TTCCGAGATG 
GGGCTTCTGG 
TCCAGCCGGA 
CCGTTCGTGA 
CGGCTGGTGC 
GTGGCGGTAG 
AATTGGACAG 
CACCTCCTGA 
AGCATCTGCC 
CTGGGCATCA 
GACCGGCATG 
GTCATCTGCT 
TOGGGCACGC 
AGCTTCACCT 
TTTCCCAACT 
CCAGATAATA 
GCTCCAGAGG 
ACCTCAAATA 
CAGTTGGGAT 

Seq ID NO: 108 Protein Sequence 
Protein Accession #: NP_006009.1 

11 21 



21 
I 

GGATCACTTT 
CAAGGTGTTG 
TGCCCTGTGG 
CAACCTGGCA 
TGTGCGGCGT 
TGCCATGAAC 
CCGGGTGGTC 
CTCTTGCCTT 
GCTGATCCAG 
GTGGCACGAA 
CTCAGCCAGA 
GAGAGCCATC 
CGTGGTTGTG 
AGTGTACCGC 
CATGCTGGAC 
TTTGATCAAC 
GAGCGTCGAG 
CAACTCCGGT 
GAAGGGACAT 
G 



31 

I 

CTGGAAATAG 
CCGCCGGTGT 
ATTTTCTGTT 
GTAGCTGACT 
TCAGACTGGA 
CGCCAGGGCA 
CATCCCCACC 
CTGTGGGGCA 
AATGGCCCTG 
GCTATGTTCC 
ATTATCTGGA 
ACCTTCATCA 
CGGATCCGCA 
TCGGTGGACC 
CCCGTGGTGT 
CGCTGCCTCC 
CTCACAGGGG 
GAGCCATGGA 
TGTCACCAAG 



41 

1 

ACAAGAAGAA 
TGGGGCTGGA 
TCCACCTCAA 
TTCTACTGAT 
AGTTTGGGGA 
GCATCATCTT 
ACGCCCTGAA 
TCACTGTTGG 
CAAATGTGTG 
TCCTGGAGTT 
GCCTGCGGCA 
TGGTGGTGGC 
TCTTCTGGCT 
TGGCGTTCTT 
ACTACTTCTC 
AGAGGAAGAT 
ACCCCAACAA 
GCCCCTCTTA 
AACCAGCATC 



51 
I 

CTGCTGTGTG 
GTTTATCTTT 
GTCCTGGAAA 
CATCTGCCTG 
CATCCCTTGC 
CCTCACGGTG 
CAAGATCTCC 
CCTAACAGTC 
CATCAGCTTC 
CCTCCTGCCC 
GAGACAAATG 
CATCGTCTTT 
CCTGCACACT 
TATCACTCTC 
CAGCCCATCC 
GACAGGTGAG 
AACCAGAGGC 
TCTGGGCCCA 
TCTGGAGAAA 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
.540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



41 



51 

268 



WO 03/003906 



PCT/US02/21338 



I I I I I I 

MNRHHLQDHF LEIDKKNCCV FRDDFIAKVL PPVLGLEFIF GLLGNGLALW IFCFHLKSWK 60 

SSRIFLFNLA VADFLLIICL PFVMDYYVRR SDWKFGDIPC RLVLFMFAMN RQGSIIFLTV 120 

VAVDRYFRW HPHHALNKIS NWTAAIISCL LWGITVGLTV HLLKKKLLIQ NGPANVCISF 180 

J SICHTFRWHE AMFLLBFLLP LGIILFCSAR IIWSLRQRQM DRHAKIKRAI TFIMWAIVF 240 

VICFLPSWV RIRIFWLLHT SGTQNCEVYR SVDLAFFITL SFTYMNSMLD PWYYFSSPS 300 

FPNFFSTLIN RCLQRKMTGE PDNNRSTSVE LTGDPNKTRG APEALMANSG EPWSPSYLGP 360 
TSNNHSKKGH CHQEPASLEK QLGCCIE 

10 Seq ID NO: 109 DNA sequence 

Nucleic Acid Accession #: NM_005030.2 
Coding sequence: 63.. 1874 ~~ 

1C 1 U 21 31 41 SI 

GGCACGAGGG AGCGGTGCGG AGGCTCTGCT CGGATCGAGG TCTGCAGCGC AGCTTCGGGA 60 

GCATGAGTGC TGCAGTGACT GCAGGGAAGC TGGCACGGGC ACCGGCCGAC CCTGGGAAAG 120 

CCGGGGTCCC CGGAGTTGCA GCTCCCGGAG CTCCGGCGGC GGCTCCACCG GCGAAAGAGA 180 

TCCCGGAGGT CCTAGTGGAC CCACGCAGCC GGCGGCGCTA TGTGCGGGGC CGCTTTTTGG 240 

ZU GCAAGGGCGG CTTTGCCAAG TGCTTCGAGA TCTCGGACGC GGACACCAAG GAGGTGTTCG 300 

CGGGCAAGAT TGTGCCTAAG TCTCTGCTGC TCAAGCCGCA CCAGAGGGAG AAGATGTCCA 360 

TGGAAATATC CATTCACCGC AGCCTCGCCC ACCAGCACGT CGTAGGATTC CACGGCTTTT 420 

TCGAGGACAA CGACTTCGTG TTCGTGGTGT TGGAGCTCTG CCGCCGGAGG TCTCTCCTGG 480 

AGCTGCACAA GAGGAGGAAA GCCCTGACTG AGCCTGAGGC CCGATACTAC CTACGGCAAA 540 

25 TTGTGCTTGG CTGCCAGTAC CTGCACCGAA ACCGAGTTAT TCATCGAGAC CTCAAGCTGG 600 

GCAACCTTTT CCTGAATGAA GATCTGGAGG TGAAAATAGG GGATTTTGGA CTGGCAACCA 660 

AAGTCGAATA TGACGGGGAG AGGAAGAAGA CCCTGTGTGG GACTCCTAAT TACATAGCTC 720 

CCGAGGTGCT GAGCAAGAAA GGGCACAGTT TCGAGGTGGA TGTGTGGTCC ATTGGGTGTA 780 

TCATGTATAC CTTGTTAGTG GGCAAACCAC CTTTTGAGAC TTCTTGCCTA AAAGAGACCT 840 

ACCTCCGGAT CAAGAAGAAT GAATACAGTA TTCCCAAGCA CATCAACCCC GTGGCCGCCT 900 

CCCTCATCCA GAAGATGCTT CAGACAGATC CCACTGCCCG CCCAACCATT AACGAGCTGC 960 

TTAATGACGA GTTCTTTACT TCTGGCTATA TCCCTGCCCG TCTCCCCATC ACCTGCCTGA 1020 

CCATTCCACC AAGGTTTTCG ATTGCTCCCA GCAGCCTGGA CCCCAGCAAC CGGAAGCCCC 1080 

TCACAGTCCT CAATAAAGGC TTGGAGAACC CCCTGCCTGA GCGTCCCCGG GAAAAAGAAG 1140 

AACCAGTGGT TCGAGAGACA GGTGAGGTGG TCGACTGCCA CCTCAGTGAC ATGCTGCAGC 1200 

AGCTGCACAG TGTCAATGCC TCCAAGCCCT CGGAGCGTGG GCTGGTCAGG CAAGAGGAGG 1260 

CTGAGGATCC TGCCTGCATC CCCATCTTCT GGGTCAGCAA GTGGGTGGAC TATTCGGACA 1320 

AGTACGGCCT TGGGTATCAG CTCTGTGATA ACAGCGTGGG GGTGCTCTTC AATGACTCAA 1380 

CACGCCTCAT CCTCTACAAT GATGGTGACA GCCTGCAGTA CATAGAGCGT GACGGCACTG 1440 

AGTCCTACCT CACCGTGAGT TCCCATCCCA ACTCCTTGAT GAAGAAGATC ACCCTCCTTA 1500 

AATATTTCCG CAATTACATG AGCGAGCACT TGCTGAAGGC AGGTGCCAAC ATCACGCCGC 1560 

GCGAAGGTGA TGAGCTCGCC CGGCTGCCCT ACCTACGGAC CTGGTTCCGC ACCCGCAGCG 1620 

CCATCATCCT GCACCTCAGC AACGGCAGCG TGCAGATCAA CTTCTTCCAG GATCACACCA 1680 

AGCTCATCTT GTGCCCACTG ATGGCAGCCG TGACCTACAT CGACGAGAAG CGGGACTTCC 1740 

GCACATACCG CCTGAGTCTC CTGGAGGAGT ACGGCTGCTG CAAGGAGCTG GCCAGCCGGC 1800 

TCCGCTACGC CCGCACTATG GTGGACAAGC TGCTGAGCTC ACGCTCGGCC AGCAACCGTC 1860 

TCAAGGCCTC CTAATAGCTG CCCTCCCCTC CGGACTGGTG CCCTCCTCAC TCCCACCTGC 1920 

ATCTGGGGCC CATACTGGTT GGCTCCCGCG GTGCCATGTC TGCAGTGTGC CCCCCAGCCC 1980 

CGGTGGCTGG GCAGAGCTGC ATCATCCTTG CAGGTGGGGG TTGCTGTGTA AGTTATTTTT 2040 

GTACATGTTC GGGTGTGGGT TCTACAGCCT TGTCCCCCTC CCCCTCAACC CCACCATATG 2100 

AATTGTACAG AATATTTCTA TTGAATTCGG AACTGTCCTT TCCTTGGCTT TATGCACATT 2160 
AAACAGATGT GAATATTCAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 



30 
35 
40 
45 
50 



55 Seq ID NO: 110 Protein Sequence 
Protein Accession ft: NP_005021.2 

1 11 21 31 41 51 

AH 1 I I I- I I 

OU MSAAVTAGKL ARAPADPGKA GVPGVAAPGA PAAAPPAKEI PEVLVDPRSR RRYVRGRFLG 60 

KGGFAKCFEI SDADTKEVFA GKIVPKSLLL KPHQREKMSM EISIHRSLAH QHWGFHGFF 120 

EDNDFVFWL ELCRRRSLLE LHKRRKALTE PEARYYLRQI VLGCQYLHRN RVIHRDLKLG 180 

NLFLNEDLEV KIGDFGLATK VEYDGERKKT LCGTPNYIAP EVLSKKGHSF EVDVWSIGCI 240 

MYTLLVGKPP FETSCLKETY LRIKKNEYSI PKHINPVAAS LIQKMLQTDP TARPTINELL 300 

65 NDEFFTSGYI PARLPITCLT IPPRFSIAPS SLDPSNRKPL TVLNKGLENP LPERPREKEE 360 

PWRETGEW DCHLSDMLQQ LHSVUASKPS ERGLVRQEEA EDPACIPIFW VSKWVDYSDK 420 

YGLGYQLCDN SVGVLFNDST RLILYNDGDS LQYIERDGTE SYLTVSSHPN SLMKKITLLK 480 

YFRNYMSEHL LKAGANITPR EGDELARLPY LRTWFRTRSA IILHLSNGSV QINFFQDHTK 540 

LILCPLMAAV TYIDEKRDFR TYRLSLLEEY GCCKELASRL RYARTMVDKL LSSRSASNRL 600 

70 KAS 

Seq ID NO: 111 DNA sequence 
Nucleic Acid Accession 8: NM_005409.3 
Coding sequence: 94.. 378 

1 11 21 31 41 51 

I I I I I i 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 60 

269 



75 



WO 03/003906 



PCT/US02/21338 



10 



15 



20 



25 



30 



CAACAGCACC 
TTGGCTGTGA 
TGTCTTTGCA 
ATAATGTACC 
AAAGGACAAC 
GAAAGAAAGA 
AACCTAGAAC 
AGACTTTTCT 
GGGTGAAAGG 
CTGCCCAAAG 
GGTTACCATC 
GCATTTCTAG 
GAGAACATTT 
CTGTGGTTAC 
CATCTATGTG 
CCAAATATCA 
TTTATAACCA 
TGGGATACTG 
GATGTTTTTC 
TGTACTTTTT 
TACAAAATGT 
AATCACTTTT 
TTGTTCATGC 
GTCATTTTTT 



AGCAGCAACA 
TATTGTGTGC 
TAGGCCCTGG 
CAAGTAACAA 
GATGCCTAAA 
ATTTTTAAAA 
AAGTTTAACT 
ATGGTTTTGT 
ACCAAAAACA 
GAGTCCAGCA 
GGAGTTTACA 
GCTAGAGAAC 
CTGTCTCTAG 
AGTGGAGACA 
TCGTAAAGCA 
TGTAGCACAT 
ATTCATTAAA 
GCAACAGTGC 
AACTTTTATT 
GTTTTGATCC 
TTTTGTCTAC 
ACTTTTTGTA 
CTATATACTG 
TCTCTAATAA 



GCAAAAAACA 
TACAGTTGTT 
GGTAAAAGCA 
CTGTGACAAA 
TCCCAAATCG 
ATATCAAAAC 
GTGACTACTG 
GACTTTCAAC 
GAAATACAGT 
ATTAAATGGA 
AAGTGCTTTC 
CTTCTAGATT 
AAGTTATCTG 
TTGACATTAT 
TTCCTCAAAC 
CAATATGTAG 
TGTAATTCAT 
ACATATTTCA 
CATTGAGATG 
GTTTGTATAA 
CAAAGAAAAA 
ATTCTGTCTC 
TAAAATTTAG 
ACTACCACAA 



AACATGAGTG 
CAAGGCTTCC 
GTGAAAGTGG 
ATAGAAGTGA 
AAGCAAGCAA 
ATATGAAGTC 
AAATGACAAG 
TTTTGTACAG 
CTTCCTGAAT 
TTTCTAGGAA 
ACGTTCTTAC 
TGATGCTTAC 
TCTGTATTGA 
TACTGGAGTC 
ATTTTTTCAT 
GGAAACATTC 
AAAATGTACT 
TAACCAAATT 
TTTTGAAGCA 
ATGATAGCAA 
TGTTGAAAAA 
TTAGAAAAAT 
GTATACTCAA 
CCTTTCTTTT 



TGAAGGGCAT 
CCATGTTCAA 
CAGATATTGA 
TTATTACCCT 
GGCTTATAAT 
CTGGAAAAGG 
AATTCTACAG 
TTATGTGAAG 
GAATGACAAT 
AAGCTACCTT 
TTGTTGTATT 
AACTATTCTG 
TCTTTATGCT 
AAGCCCTTAT 
GCAAATACAC 
TTATGCATCA 
ATGAAAAAAA 
AGCAGCACCG 
ATTAGGATAT 
TATCTTGGAC 
TAAGCAAATG 
ACATAATCTA 
GACTAGTTTA 
TTAAAAAAAA 



GGCTATAGCC 
AAGAGGACGC 
GAAAGCCTCC 
GAAAGAAAAT 
CAAAAAAGTT 
GCATCTGAAA 
TAGGAAACTG 
GATGAAAGGT 
CAGAATTCCA 
AAGAAAGGCT 
ATACATTCAT 
TTGTGACTAT 
ATATTACTAT 
AAGTCAAAAG 
ACTTCTTTCC 
TTTGGTTTGT 
TTATACGCTA 
GTCTTAATTT 
GTGTGTTTAC 
ACATTTGAAA 
TATACCTAGC 
ATCAATTTCT 
AAGAATCAAA 



Seq ID NO: 112 Protein Sequence 
Protein Accession #: NP_005400.1 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 



11 



21 31 41 51 

I I I I 

MSVKGMAIAL AVILCATWQ GFPMFKRGRC LCIGPGVKAV KVADIEKASI MYPSNNCDKI 

EVI ITLKENK GQRCLNPKSK QARLIIKKVE RKNF 



60 



35 

40 

45 

50 

55 

60 

65 

70 

75 



Seq ID NO: 113 DNA sequence 

Nucleic Acid Accession #: NM_00 1110.1 

Coding sequence: 470-.2716 



11 



GAATTCGAGG 
CGCGCGCATG 
GCGGCTCCGC 
TGAGTTTCGA 
GGAGGAAGGA 
CCGGCCCCTG 
CCGGCTTCCC 
CTGCTGTTAA 
GAGAGTGTTA 
TTTAAATAAA 
AAAACACCAG 
CCATGCCCAT 
TGAATTTAAA 
TGGACATATT 
TGAAGGATTC 
TAAAGACCGA 
CCATAAATAC 
ATACCAGATG 
TGGTCCAGAA 
TTATATTCAG 
CCAGATATCC 
. AATCCGTAAC 
GGACCCTACA 
TTCTGAGCAG 
TGATGGCGTA 
TGAAAAAAGT 
TGTTCAGAAC 
AGTTGGACAT 
TAAGAATTTG 
GGACAAACTT 
TGAGAAGAAG 
GGTAGAACAA 
CTTCGATGCA 
TCCAAGTCAA 
TCGGGATGAT 
AGCATCTGAC 
TGGGCAATGT 
TTCTGATGGC 



ATCCGGGTAC 
CGCGCGCCCC 
GTGCCAGTTG 
GGGAGGGGGG 
AACGAACGAG 
AAGTGGAGCG 
GTGGAGGCTC 
CCCGTGAGGA 
ATTCTGCTCC 
TATATCAGAC 
CGTGCCAAAA 
GGAAGACATT 
GTAGAAACAT 
TATGGTGAAG 
ATCCAGACTC 
ACTCTGCCAT 
GGTCCTCAGG 
ACTGGTGTAG 
CTTCTGAGGA 
ACTGATCATT 
AGTCATGTTA 
ATCAGTTTCA 
AATCCTTTCC 
AATCATGATG 
CTTGGTCTGG 
AAACTCTATT 
TATGGGTCTC 
AACTTTGGAT 
GGTCAAAAAG 
AACAACAATA 
AGAAACAACT 
GGTGAAGAAT 
AATCAACCAG 
GGTCCTTGTT 
TCAGACTGTG 
CCTAAACCAA 
GCAGGTTCTA 
AAAGATGATA 



21 
1 

CATGGGCGGC 
TGAAGCGCCT 
GGTGCCCGCG 
GAGAGAAGAG 
GGGGAGGGAG 
AGAGGGAGGT 
CGGACCAAGC 
GGCGGCGGCG 
TCTCCTGGGC 
ATTATGAAGG 
GAGCAGTCTC 
TCAACCTACG 
CAAATAAAGT 
AAGGAAGTTT 
GTGGTGGCAC 
TTCACTCTGT 
GGGGCTGTGC 
AGGAAGTAAC 
AAAAACGTAC 
TGTTCTTTAA 
AAGCGATTGA 
TGGTGAAACG 
GTTTCCCAAA 
ACTACTGTTT 
CTTGGGTTGG 
CAGATGGTAA 
ATGTACCTCC 
CCCCACATGA 
AAAATGGCAA 
AATTCTCACT 
GTTTTGTTGA 
GTGATTGTGG 
AGGGAAGAAA 
GTACAGCACA 
CAAGGGAAGG 
ACTTCACAGA 
TCTGTGAGAA 
AAGAATTATG 



31 
i 

GGCAGGCCTA 
GGGGGACGGG 
CGTCACGTGG 
GGAACGAGCA 
GTCCCTGTTT 
GCTTCGCCGT 
CCCTTCAGCT 
GCGGCAGCGG 
GGCGGGGATG 
ATTATCTTAC 
ACATGAAGAC 
AATGAAGAGG 
ACTTGATTAT 
TAGCCATGGG 
ATTTTATGTT 
CATTTATCAT 
AGATCATTCA 
ACAGATACCT 
AACTTCAGCT 
ATATTACGGA 
TACAATTTAC 
CATAAGAATC 
TATTGGTGTG 
GGCCTATGTC 
AGCACCTTCA 
GAAGAAGTCC 
CAAAGTCTCT 
TTCTGGAACA 
TTACATCATG 
CTGTAGTATT 
ATCTGGCCAA 
CTATAGTGAC 
ATGCAAACTG 
GTGTGCATTC 
AATATGTAAT 
CTGTAATAGG 
ATATGGCTTA 
CCATGTATGC 



41 



51 



GCAGCACGGG 
TATGGGCGGG 
TGAGGAAGGA 
AGGGAAGGAA 
TGGAGGAGCT 
TTCTCCTGCC 
TCTCCCTCCG 
CAGCGGAAGA 
GGAGGTCAGT 
AATGTGGATT 
CAATTTTTAC 
GACACTTCCC 
GATACCTCTC 
TCTGTTATTG 
GAGCCAGCAG 
GAAGATGATA 
GTATTTGAAA 
CAAGAAGAAC 
GAAAAAAATA 
ACACGAGAAG 
CAGACCACAG 
AATACAACTG 
GAGAAGTTTC 
TTCACAGACC 
GGAAGCTCTG 
TTAAACACTG 
CACATTACTT 
GAGTGCACAC 
TATGCAAGAG 
AGAAATATAA 
CCTATTTGTG 
CAGTGTAAAG 
AAACCTGGGA 
AAGTCAAAGT 
GGCTTCACAG 
CATACACAAG 
GAGGAGTGTA 
TGTATGAAGA 



AACCGTCCCC 
AGGTAGGGGC 
GGCGGAGGTC 
AGCGGGGAAA 
AGGAGCGTTG 
AGGGGAGGTC 
GATCGATGTG 
TGGTGTTGCT 
ATGGGAATCC 
CATTACACCA 
GTCTAGATTT 
TTTTCAGTGA 
ATATTTACAC 
ATGGAAGATT 
AGAGATATAT 
TTAACTATCC 
GAATGAGGAA 
ATGCTGCTAA 
CTTGTCAGCT 
CTGTGATTGC 
ACTTCTCCGG 
CTGATGAGAA 
TGGAATTGAA 
GAGATTTTGA 
GAGGAATATG 
GAATTATTAC 
TTGCTCACGA 
CAGGAGAATC 
CAACATCTGG 
GCCAAGTTCT 
GAAATGGAAT 
ATGAATGCTG 
AACAGTGCAG 
CTGAGAAGTG 
CTCTCTGCCC 
TGTGCATTAA 
CGTGTGCCAG 
AAATGGACCC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

I960 

2040 

2100 

2160 

2220 

2280 
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ATCAACTTGT 
CACCCTGCAA 
GTGCAGATTA 
AGAGCTCTAT 
GGGAATTGCT 
AAGTAGTAAT 
ACCTCCACAG 
ACACATGAGA 
ACTTCACTCC 
CAGTTGAAGA 
TAAAGCCTGA 
AACAGATATT 
CCTGTTCTTC 
TACCACAAAT 
CACCTCAACA 
TTTCTTACGT 
TCTTTGTATT 
AGGATAACAG 
GTTTTGAAAT 



GCCAGTACAG 
CCTGGATCCC 
GTAGATGCTG 
GAAAACATTG 
CTGATCATGC 
CCAAAGTTGC 
CCCATTCAGC 
OGCTAACTGC 
AAAGAGAAAC 
AAAAATGGCA 
AAATTCCAAT 
TTTAACTTAA 
GTTGCTGCAT 
TGAAATAAAT 
TTG GAGA CAT 
ACACTGTACT 
ATAAAATTTT 
AGAATGGTGG 
GAAAATCAAT 



GGTCTGTGCA 
CTTGCAACGA 
ATGGTCCTCT 
CTGAATGGAT 
TAATGGCTGG 
CTCCTCCTAA 
AACCCCAGCG 
AGCTTTTGCC 
CTATTAAGTC 
AGAGATCATA 
TTGGGGGTGG 
TGGCACAAAG 
TTTCTTCACT 
ATATTTTTTT 
CACTTGCCAA 
TCTGTGTGCA 
TCCGCTCTTA 
TATTCAGTGG 
TTACCCCATG 



GTGGAGTAGG 
TTTTAGAGGT 
AGCTAGGCTT 
TGTGGCTCAT 
ATTTATTAAG 
ACCACTTCCA 
TCAGCGGCCC 
TTGGTTCTTC 
ATCATCTCCA 
TCCTCAGACC 
GAGGTGGAAA 
TCTTAGAATA 
TGCAGGCAAA 
CAACTGCCAA 
TGTACATACC 
ATTGTAAACA 
ATTAAAAATT 
TCCAGGATTC 
GTACCCGGAT 



Seq ID NO: 114 Protein Sequence 
Protein Accession #: NP 001101.1 



l 
I 

MVLLRVLILL 
RLDFHAHGRH 
DGRFEGFIQT 
RMRKYQMTGV 
AVIAQISSHV 
LELNSEGNHD 
GIITVQNYGS 
ATSGDKLNNN 
DECCFDANQP 
ALCPASDPKP 
KMDPSTCAST 
IFSPELYENI 
KRRRPPQPIQ 



11 
1 

LSWAAGMGGQ 
FNLRMKRDTS 
RGGTFYVEPA 
EEVTQIPQEE 
KAIDTIYQTT 
DYCLAYVFTD 
HVPPKVSHIT 
KFSLCSIRNI 
EGRKCKLKPG 
NFTDCNRHTQ 
GSVQWSRHFS 
AEWIVAHWWA 



21 

I 

YGNPLNKYIR 
LFSDEFKVET 
ERYIKDRTLP 
HAANGPEIiLR 
DFSGIRNISF 
RDFDDGVTjGL 
FAHEVGHNFG 
SQVLEKKRNN 
KQCSPSQGPC 
VCINGQCAGS 
GRTITLQPGS 
VLLMGIALIM 
YQMGHMRR 



31 
i 

HYEGLSYNVD 
SNKVLDYDTS 
FHSVIYHEDD 
KKRTTSAEKN 
MVKRIRINTT 
AWVGAPSGSS 
SPHDSGTECT 
CFVESGQPIC 
CTAQCAFKSK 
ICEKYGLEEC 
PCNDFRGYCD 
LMAGFIKICS 



CACTTCAGTG 
TACTGTGATG 
AAAAAAGCAA 
TGGTGGGCAG 
ATATGCAGTG 
GGCACTTTAA 
CGAGAGAGTT 
CTAGTGCCTA 
AACTAAACCC 
AGGTGGAATT 
AGGAACCCAA 
TTATTATGTG 
CTTGGCTCTC 
TCAAGGCTAG 
TTGTTATATG 
GAAATTGCAA 
ACTGTTTAAT 
TGTAATGCTT 
CCTCGAATTC 



41 

I 

SLHQKHQRAK 
HIYTGHIYGE 
INYPHKYGPQ 
TCQLYIQTDH 
ADEKDPTNPF 
GGICEKSKLY 
PGESKNLGQK 
GNGMVEQGEE 
SEKCRDDSDC 
TCASSDGKDD 
VFMRCRLVDA 
VHTPSSNPKL 



GTCGAACCAT 
TTTTCATGCG 
TTTTTAGTCC 
TATTACTTAT 
TTCATACTCC 
AGAGGAGGAG 
ATCAAATGGG 
CAATGGGAAA 
TCACAAGTAA 
ACTTAAATTT 
TTTTCTTATG 
CCCCGTGTTC 
AATAAACTTT 
GAGGCTCGAC 
CAGACATGTA 
TATGGATGTT 
TGACATACTC 
TACACAGGCA 



51 
1 

RAVSHEDQFL 
EGSFSHGSVI 
GGCADHSVFE 
LFFKYYGTRE 
RFPNIGVEKF 
SDGKKKSLNT 
ENGNYIMYAR 
CDCGYSDQCK 
AREGICNGFT 
KELCHVCCMK 
DGPIARLKKA 
PPPKPLPGTL 



Seq ID NO: 115 DNA sequence 

Nucleic Acid Accession #: NM_000577.2 

Coding sequence: 41^.520 



GGCACGAGGG 
CCGACCCTCT 
GAAGACCTTC 
CAATTTAGAA 
CCATGGAGGG 
GGAGGCAGTT 
CATCCGCTCA 
CCTCTGCACA 
CGTCATGGTC 
TCCCATTCTT 
GCTATGGGGG 
CCTGGTCACA 
GTCTTTCTAA 
TCTGCATTCA 
CTTCCTCCCT 
ACCAAGTGGC 
TTTAAGGGTT 
GAAGGAGAGC 
ATTCCTGCAT 
TTTTTTTGTG 
ATTTTTTTTT 
CCCAGCCTCC 
TCCTATCAGA 
TCTTCCTCTG 
ACTTGTATGA 
GAGCAGGAAA 
CTCTTGGCAG 
CTGTGACTTC 
AAAAAAAAAA 



11 
1 

GAAGACCTCC 
GGGAGAAAAT 
TATCTGAGGA 
GAAAAGATAG 
AAGATGTGCC 
AACATCACTG 
GACAGTGGCC 
GCGATGGAAG 
ACCAAATTCT 
GCATGGCAAG 
CACTGAGGAC 
GGACTCTGCC 
TGTGTGAATC 
GGATCAAACC 
CATTCCACCT 
TCCCACACCC 
TGTGGAAAAT 
CCTTCATTTG 
TTGTGAAATG 
ATGTCCCAAC 
TCCTTTTAAA 
AAGCTCCATC 
AGTTTCTCAG 
CTGAAGGAAT 
AAGATGGCTG 
CATGACTCGT 
GTACTCAGCG 
AGCTCTGTTT 
AAAAAAAAAA 



21 
1 

TGTCCTATCA 
CCAGCAAGAT 
ACAACCAACT 
ATGTGGTACC 
TGTCCTGTGT 
ACCTGAGCGA 
CCACCACCAG 
CTGACCAGCC 
ACTTCCAGGA 
GACTGCAGGG 
CAGCCATTGA 
TCCTCTTCAA 
AGAGCACAGC 
CCGACCACCT 
TCCCATGCCC 
TGTTTTACAA 
GAAAATTAGG 
GAGATTATGT 
ATGGTGAAAG 
TTGTAAAAAT 
ACACTTCCAT 
TCCACTCCAG 
CTCCCAAGGC 
AAATTGCTCC 
TGCCTCTGCC 
ATATGTCTCA 
AATGAATGCT 
TACAATAAAA 
AAAAAAAAAA 



31 
1 

GGCCCTCCCC 
GCAAGCCTTC 
AGTTGCCGGA 
CATTGAGCCT 
CAAGTCTGGT 
GAACAGAAAG 
TTTTGAGTCT 
CGTCAGCCTC 
GGACGAGTAG 
ACTGCCAGTC 
GGGGTGGACC 
CTGACCAGCC 
AGCCCCTGCA 
GCCCAACCTG 
TGGATCCATC 
AAAAGAAAAG 
ATTTCATGAT 

TAAGTGGTAG 
TAAAAGTTAT 
AATCTGGACT 
ATTTTTTACA 
TCTGAGCAAA 
TTGACATTGT 
TGTCTCCCCC 
GGTCCCTGCA 
GTATATGTTG 
TCTTGAAAAT 



41 
I 

ATGGCTTTAG 
AGAATCTGGG 
TACTTGCAAG 
CATGCTCTGT 
GATGAGACCA 
CAGGACAAGC 
GCCGCCTGCC 
ACCAATATGC 
TACTGCCCAG 
CCCCTGCCCC 
CTCAGAAGGC 
TCCATGCTGC 
CAAAGCCCTT 
CTCTCCTCTT 
AGGCCACTTG 
ACCAGTCCAT 

AGAGGCTGAG 
CTTTTCCCTT 
GGTACTATGT 
CCTCTGTCCA 
GCTGCCTGCA 
TGTGGCTCCT 
AGAGCTTCTG 
ACCAGGCTGG 
GGGCCAAGCA 
GGTGCAAAGT 
GCCTAAAAAA 



51 
I 

AGACGATCTG 
ATGTTAACCA 
GACCAAATGT 
TCTTGGGAAT 
GACTCCAGCT 
GCTTCGCCTT 
CCGGTTGGTT 
CTGACGAAGG 
GCCTGCCTGT 
AGGGCTCCCG 
GTCACAACAA 
CTCCAGAATG 
CCATGTCGCC 
GCCACTGCCT 
ATGACCCCCA 
GAGGGAGGTT 
CAGTCCCCGT 
GACTTAAAAT 
CTTTTTCTTC 
TAGCCCCATA 
GGCACTGCTG 
GTACTTTACC 
GGGGGTTCTT 
GCACTTGGAG 
GAGCTCTGCA 
CCTAGCCTCG 
TCCCTACTTC 
AAAAAAAAAA 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 



60 

120 

180 

240 

300 

3 60 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 



Seq ID NO; 116 Protein Sequence 
Protein Accession ft: NP 000568.1 



271 



WO 03/003906 



1 11 21 31 41 51 

I I I I I I 

MALETICRPS GRKSSKMQAF RIWDVNQKTF YLRNNQLVAG YLQGPNVNLE EKIDWPIEP 60 

HALFLGIHGG KMCLSCVKSG DETRLQLEAV NITDLSENRK QDKRFAFIRS DSGPTTSFES 120 

AACPGWFLCT AMEADQPVSL TNMPDEGVMV TKFYFQEDE 



Seq ID NO: 117 DNA sequence 

Nucleic Acid Accession 8: CAT cluster 



1 11 21 31 41 51 

GCACTTTAGC TCAGTGATGG ATTTTAAGAA CGATTACACC TGTCGCCTGT GGTCTGACTC 60 

CAGGCACTCG CGTCAGGTAC TTCTGCTCCA GGATAGCTTT ATGAATTGCT CTGACAGCAT 120 

CATCAATGGT TCCTTTCGTG CGCTTGGCTT TATTCATGAG GCTCAGGTCG GGGAAAGACT 180 

GATGGTCCAC TGTGACAGCA AGACAGGTAA TGCAAATACG GATTTCATCT GGGTGGGTCC 240 

AGATAACAGA CTGCTAGAGC CGGATAAAGA GATGGAAAAC TTTTACGTGT TTCACAATGG 300 

AAGTCTGGTT ATAGAAAGCC CTCGTTTTGA GGATGCTGGA GTGTATTCTT GTATCGCAAT - 360 

GAATAAGCAA CGCCTGTTAA ATGAAACTGT GGACGTCACA ATAAATGTGA GCAATTTCAC 420 

TGTAAGCAGA TCCCATGCTC ATGAGGCATT TAACACAGCT TTTACCACTC TTGCTGCTTG 480 

' CGTGGCCAGT ATCGTTTTGG TACTTTTGTA CCTCTATCTG ACTCCATGCC CCTGCAAGTG 540 

TAAAACCAAG AGACAGAAAA ATATGCTACA CCAAAGCAAT GCCCATTCAT CGATTCTCAG 600 

TCCTGGCCCC GCTAGTGATG CCTCCGCTGA TGAACGGAAG GCAGGTGCAG GTAAAAGAGT 660 

GGTGTTTTTG GAACCCCTGA AGGATACTGC AGCAGGGCAG AACGGGAAAG TCAGGCTCTT 720 

TCCCAGCGAG GCAGTGATAG CTGAGGGCAT CCTAAAGTCC ACGAGGGGGA AATCTGACTC 780 
AGATTCAGTC AATTCAGTGT TTTCTGACAC ACCTTTTGTG GCGTCCACTT AA 



60 



Seq ID NO: 118 DNA sequence 
Nucleic Acid Accession #: AB032989.1 
Coding sequence: 1..1316 

1 11 21 31 41 51 

| | 1 S i 1 

CCCATTCCTT GCCCAGTTAC ACAGCACTAC TGGACCTCAG TCACAACAAC CTGAGCCGCC 

TGCGGGCCGA GTGGACCCCC ACGCGCCTGA CCCAACTGCA CTCCCTGCTG CTGAGCCACA 120 

ACCACCTGAA CTTCATCTCC TCTGAGGCCT TTTCCCCGGT ACCCAACCTG CGCTACCTGG 1B0 

ACCTCTCCTC CAACCAGCTG CGTACACTGG ATGAGTTCCT GTTCAGTGAC CTGCAAGTAC 240 

TGGAGGTGCT GCTGCTCTAC AATAACCACA TCATGGCGGT GGACCGGTGC GCCTTCGATG 300 

ACATGGCCCA GCTGCAGAAA CTCTACTTGA GCCAGAACCA GATCTCTCGC TTCCCTCTGG 360 

AACTGGTCAA GGAAGGAGCC AAGCTACCCA AACTAACGCT CCTGGATCTC TCTTCTAACA 420 

AGCTGAAGAA CTTGCCATTG CCTGACCTGC AGAAGCTGCC GGCCTGGATC AAGAATGGGC 480 

TGTACCTACA TAACAACCCC CTGAACTGCG ACTGTGAGCT CTACCAGCTG TTTTCACACT 540 

GGCAGTATCG GCAGCTGAGC TCCGTGATGG ACTTTCAAGA GGATCTGTAC TGCATGAACT 600 

CCAAGAAGCT GCACAATGTC TTCAACCTGA GTTTCCTCAA CTGTGGCGAG TACAAGGAGC 660 

GTGCCTGGGA GGCCCACCTG GGTGACACCT TGATCATCAA GTGTGACACC AAGCAGCAAG 720 

GGATGACCAA GGTGTGGGTG ACACCAAGTA ATGAACGGGT GCTAGATGAG GTGACCAATG 780 

GCACAGTGAG TGTGTCTAAG GATGGCAGTC TTCTTTTCCA GCAGGTGCAG GTCGAGGACG 840 

GTGGTGTGTA TACCTGCTAT GCCATGGGAG AGACTTTCAA TGAGACACTG TCTGTGGAAT 900 

TGAAAGTGCA CAATTTCACC TTGCACGGAC ACCATGACAC CCTCAACACA GCCTATACCA 960 

CCCTAGTGGG CTGTATCCTT AGTGTGGTCC TGGTCCTCAT ATACCTATAC CTCACCCCTT 1020 

GCCGCTGCTG GTGCCGGGGT GTAGAGAAGC CTTCCAGCCA TCAAGGAGAC AGCCTCAGCT 1080 

CTTCCATGCT TAGTACCACA CCCAACCATG ATCCTATGGC TGGTGGGGAC AAAGATGATG 1140 

GTTTTGACCG GCGGGTGGCT TTCCTGGAAC CTGCTGGACC TGGGCAGGGT CAAAACGGCA 1200 

AGCTCAAGCC AGGCAACACC CTGCCAGTGC CTGAGGCCAC AGGCAAGGGC CAACGGAGGA 1260 

TGTCGGATCC AGAATCAGTC AGCTCGGTCT TCTCTGATAC GCCCATTGTG GTGTGAGCAG 1320 

GATGGGTTGG TGGGGAGATT CTGCCCCAGG AGAGGTAATG CACCCCTGAA GGATATGAGG 1380 

GGATGGAAGA GAGGGCTGGC TGCCCAAGGG AATGGATTCC TCCTGACCTC AAGGGAATTG 1440 

GTCCCAGGTA CAACAGAGAG CAAGACCCCA AACAGGGCGT GGCTGCCACG ATTTTCAAAT 1500 

GGGGGTATAT TAATCCTCAG GCAAATGCTA CACCCGTACC CAAAGCCCTG CCAATTCTCA 1560 

GTGTGGTAGA GGAAGAGAAG CATGTCTGAG TTGGATGGGA ATGAGGAAGG AGTGAGGGGA 1620 

AGAACAGATT CCCTAAACTT TCATGAGGTC ATCCCTAGCT CCTTAAGAGA AAAACATTTA 1680 

GAAAAAAAAA TTTTTTCTAT CTCTGCCCAC CCACACCTGT GATGTTGTGT GTGTTGGGGG 1740 

AGCTTCCACA GGAGCCACAC TGGGGAAAAG CTGTAGTATC TTCTTTGTTT AGCAAGTCAA 1800 

ACTTCACAGC TGGGGAAGTT GAAAACCTTT GGAAAATGAG TCAGGAAAAG GCTGAGACCT I860 

CAATCGGTAA CAATTCCCCA AAGCTGTTGT AAGACTTTGC ATTAAAGCCT TTCTTTTTCC 1920 

TGATGTGCCC TGGGTGGATG GATCTTTGCA GGAGCCTGGC CTGCTGCCTT TTAGTTACAA 1980 

CAGCAATGTG GGATGTGGTA TCTGTGCCTT GTCCTGGGAC CAATGGCACA GAGGGTTATA 2040 

GCATGAACTG AAGGGTCCAT GTCACAGCAT AGGTGCCAGC ACGAAACTTG TAGATGGGGG 2100 

GTGAATTCCA GATAGGCTAA TTCCCTGGTT GGTGAAACCC TTGGTTAGTA TTTGATTCCC 2160 

AGCATCTGTG TGTCAGGCCA GAGTGGGGGC TGCTGAAGGC AGAGCTTTTA TGGATGCCTG 2220 

CCACAGACGT GGTACATGGG CCCACTTCAC GATAGAAGGA TGAAGGTAAA TCAGCCTCTA 2280 

TTATGTAGCA TCCAGCTGGA CCTAGCTGAT TCTCAATGTA TGCCATCTCT TCATCCAGTC 2340 

TCCTTATCAA GCTCAGGTGG GATTCTGAAT TTTCCCTAAT ATGCTGCTCT GTAAAGGGAG 2400 

GAGTCCAAGA AAGCCTGGCC CCAGCTGGCT TAGAGTAGGC TTGGCGATAG TGAAAATCCA 2460 

GTATATATCT CCTCTGTGTG GACAAGGTTG CCTTGCTCCT CTCAGAGATA CCACCTCTGC 2520 

TGCAAAGTCA AAGCATGGGG TTTTGACATC AGAAGGGTTG GAATTGGGTG GTCTCTACCC 2580 

CAGAATGGCA ATTTGGAGTG CAGCTTCAAG TTCTGCCCTC TCCACAGATT CCAGGCAATC 2640 

CTGAAGCAGA GATAATCAGT AGTACTAAAC TTTCAGAAGG GGAGTTCTGC CACCCCATTC 2700 

272 



WO 03/003906 



CTCCATCGAG 
CTCAACATTC 
TTTAACCTTT 
GGGGATGAGC 
CTCTCCCACG 
TTACATTCTA 
GCTTCCTATT 
TCAAGACTGC 
TTCCCTCAAG 
CACCTTGACT 
GTCTCCACCA 
GACCATGCCA 
TTCACCCACA 
ATGGGTTGAA 
AGAAACCTCC 
GTCATTGAAG 
TGTTAACTGT 
TTGTTACTGG 
AATCTCATCT 
AAAGTGAGCG 
GGAGGGCCGT 
TCTTAAGTCT 
TACCTGGCTG 
CCCAAAGGTA 
CCTGTGTTGA 
TTACCCTACC 
CAGAGGATGA 
ATGGGGCCCA 
GGATCCAGTT 
AGCAGGCCTC 
TCTTGGAGCC 
GTTCTGGC 



AGGTCTTTGC 
CATTCCATCC 
CCCTTTGACT 
AGGGTGAAGG 
TTTCTAGTCC 
AGAGGTGTCA 
ATGATTGGGG 
TGTCCATCCT 
TACCTGCATG 
TCCACCAGGA 
ATGTGCACAA 
GGCTCTTCAA 
TCTCCATGGC 
GGCCTTGGGA 
CGTGAAGACT 
GACTCAGAAG 
CCTCAGGAGA 
AGCTATAGGA 
CCATGCTGCA 
TTAAATGAAG 
TTTCTTCATA 
CCTGACTATA 
CCCTACCAAG 
GCTTTTTAAT 
AAACTGCCAG 
CCACACCCAT 
CAGGGCTCTA 
AGAAAGGGGC 
TTCTGTGTGT 
CCGTGTCCAA 
TCTGGGCTCT 



AGTTGTGGTT 
AAGGTTGCAG 
ATACTGCATG 
TGAAAATGTT 
ATTGGAAGTT 
TTTACCTGTG 
AAGGGGACTC 
TTATTAGTAA 
CTCATAGGTA 
CAGTGAATCT 
CTTCATGCAC 
ATACACACTC 
ACTGAATCAC 
GAAGAGGTGG 
ATATATATGA 
GAAACTGGCA 
GAAAAGTTGT 
ACTAGGACTC 
GTCTCCCCGC 
CAGGCCTTCT 
ATTCCTACCC 
TCCACTAGTT 
AGATTTGATT 
TGGAGAATTG 
GTGTGGGTAT 
AAACCAGACA 
CAATCTGTGT 
CAGGAGAGGG 
TCTTACCTCT 
GACCCCATTC 
GAAAACTAGA 



GTGAGGCCAG 
AATACTTTGA 
CCCTGTGAGT 
CTCTGATGTT 
CTCCAGATCC 
GGTGAGGGGC 
AACATCTTCC 
CCTAGTTTTG 
TGCATGCTTA 
GTAGTCCCCA 
ACAATTGTAT 
CTGTATTTTC 
TTTTCTGCCC 
TAAGGAGCTA 
GCAATATATG 
AGGGGCTAAG 
CTTGATTTAT 
CCTCCTCAGA 
TTGGCACCAT 
GGGGCATCGT 
AGTTTAGGGA 
TGGCCTTGAG 
TGTTTCTTCC 
AATAATCAGC 
AAGAAAAGGC 
TGTCTCCCAG 
AATTATTATT 
TACACCCCAG 
GTACTTCCTC 
CTCCCACTTG 
CAATGATCAT 



AGGAACTTAA 
GTATGGAGAT 
GGCTGCTCTG 
GGTTTTTCTT 
TCTAGATTTC 
TTGGAGGGCC 
CCTTGTCTGG 
TTGAGAGAGA 
TAGGTATGCA 
TTAATGATAC 
TCCTCTTTAG 
ATTTCTGAAC 
AGGCTGGAGT 
CCTGGTTCTT 
AGCAATTAAG 
TAGCTGTGAG 
TCCAGAGACT 
CTCATCAGGA 
CTAGGGCTCT 
GATGGAGCTC 
ATGAGAAAAG 
TTTGAGACAC 
CGTGGTTTCA 
ATATGTAAGC 
CGAGAGATCA 
GAAGCAGGTG 
ATTATTATTA 
CTGGGGAGAG 
GTAGCTCTGC 
TGTACACCTA 
TAAACCTGGC 



GAAGAATGAG 
GACCGATCCC 
AGCTTTGGCG 
AGTGTCTCTT 
TGACCTCTTT 
TGACTATTAA 
TCCTGGGAAC 
GGTGCAGCAT 
TGCACACCAG 
TCTCAGAATG 
GAGAGACAGT 
TCCCTGAAGC 
TAGAAGAACC 
TGGACTCCTC 
ATAGAACTAG 
CTGCTCTTCC 
CCTGAGAATT 
AATCTGGGAA 
GAAGATCTTT 
CTGAGGACTG 
GCCCTGTGGC 
CTGGTAGATG 
GGTGATAGAA 
GCACTAGAAC 
CCACCTCCTC 
TCCCTGGACA 
TTTTGTATTT 
CAAAGCAGAT 
TGACAAAGCA 
GGCTGGCAAA 
TTGAGTCTCT 



2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 



Seq ID NO: 119 Protein Sequence 
Protein Accession #: BAA86477.1 



I 

HSLPSYTALL 
LSSNQLRTLD 
LVKEGAKLPK 
QYRQLSSVMD 
MTKVWVTPSN 
KVHNFTLHGH 
SMLSTTPNHD 
SDPESVSSVF 



11 
I 

DLSHNNLSRL 
EFLFSDLQVL 
LTLLDLSSNK 
FQEDLYCMNS 
ERVLDEVTNG 
HDTLNTAYTT 
PMAGGDKDDG 
SDTPIW 



21 
I 

RAEWTPTRLT 
EVLLLYNNHI 
LKNLPLPDLQ 
KKLHNVFNLS 
TVSVSKDGSL 
LVGCILSWL 
FDRRVAFLEP 



31 
I 

QLHSLLLSHN 
MAVDRCAFDD 
KLPAWIKNGL 
FLNCGEYKER 
LFQQVQVEDG 
VLIYLYLTPC 
AGPGQGQNGK 



41 
I 

HLNFISSEAF 
MAQLQKLYLS 
YLHNNPLNCD 
AWEAHLGDTL 
GVYTCYAMGE 
RCWCRGVEKP 
LKPGNTLPVP 



51 
I 

SPVPNLRYLD 
QNQISRFPLE 
CEliYQLFSHW 
IIKCDTKQQG 
TFNETliSVEL 



EATGKGQRRM 



60 

120 

180 

240 

300 

360 

420 



Sec ID NO: 120 DNA sequence 
Nucleic Acid Accession #: NM_O05O7l 
Coding sequence: 9. .1703 



GATAGACCAT 
GCCGGGTGGG 
GCCTGCGCCT 
TCATTCTGCT 
CATATCAGCT 
GGATGCTGCA 
CCCTGGACAA 
CCACCATCAT 
GCTCCAAGGA 
TCATGGACCT 
TCAAGACGCA 
CTGAGCCGGG 
AAAATGTCAC 
CCGTGCCTGG 
TTGGGCTGGT 
GCCTCAATGA 
TCCTGTTCCT 
AGCTGGGCAT 
TTCCCCTCAT 
TACAAGCCCT 
TCCGCTGCCT 
TGGGCGCCAC 
TTGCTCAAGT 
CCACAGCAGC 
TTGTGCTTAC 



11 
I 

GAGCAGCCAT 
CTGGCTGCAG 
GCAGACCATG 
GACGGTCAGC 
CACCTACCGC 
GATGCTGGTG 
CAAGGCCACG 
CGCGGTCTTC 
GGGGCTGCAC 
GATCAGAAAT 
GTACAGCACG 
TGCCTCCATG 
TCGGGCCTTG 
CTCCGCCAAT 
CATTGGTGGC 
GGCTATTATG 
GATTGCTGGG 
GTACACCCTG 
CTACTTCCTC 
CATCACCGCT 
GGAGGAGGGC 
GGTCAACATG 
TAACAACTAC 
CAGTGTTGGG 
GTCGGTCGGC 



21 

I 

GGCAACAGCC 
CGGCTGCAGG 
ACCCTCGAGC 
GCCGTGGTCA 
CAGATCAAGT 
TTACCTCTCA 
GGGCGGATGG 
ATCGGCATCC 
CGGGAGGGCC 
ATGTTTCCAC 
AGGGTGGTAA 
CCTCCTCCAT 
GGTACCCTGC 
GGCATCAACG 
ATGAAACACA 
AGGCTGGTGG 
AAGATTCTGG 
ACCGTCATCG 
GTCACTCACC 
ATGGGCACGT 
CTGGGTGTGG 
GATGGCACTG 
GAGCTCAACC 
GCTGCTGGCA 
TTGCCCACGG 



31 
I 

TGTTCCTTCG 
AAAGCCTGCA 
ACGTGCTGCG 
TTGGGGTCAG 
ACTTCTCTTT 
TTGTCTCCAG 
GGATGCGGGC 
TCATGGTCAC 
GGATCGAGAC 
CAAACCTTGT 
CCAGGACCAT 
TCTCAGTGGA 
AGGAGATGCT 
CCCTGGGCCT 
AGGGCAGAGT 
GCATCATTAT 
AGATGGAAGA 
TGGGCCTGTT 
GGAACCCCTT 
CTTCCAGCTC 
ACCGCCGCAT 
CCCTCTACGA 
TGGGTCAGAT 
TCCCCCAGGC 
AAGACATCAC 



41 
1 

GGAGAGCGGC 
GCAGAGAGCA 
CTTCCTGCGC 
CCTGGCCTTT 
TCCTGGAGAG 
CCTGGTCACA 
AGCTGTGTAC 
CATCATCCAT 
CATCCCCACA 
GGAGGCCTGC 
GGTGAGGACA 
GAACGGAACC 
GAGCTTTGAG 
CGTGGTCTTC 
CCTCAGGGAC 
CTGGTATGCA 
CATGGCCGTC 
CCTCCATGCC 
CCCCTTCATT 
GGCAACGCTG 
CACCAGGTTC 
GGCCCTGGCT 
CACAACCATC 
GGGTCTGGTC 
GCTCATCATC 



51 
I 

CAGCGGCTGG 
CTGCGCACGC 
CGAAACGCCT 
GCCCTGCGCC 
CTTCTGATGA 
GGTATGGCAT 
TACATGGTGA 
CCCGGGAAGG 
GCTGATGCCT 
TTCAAACAGT 
GAGAACGGGT 
AGCTTCCTGG . 
GAGACTGTAC 
TCTGTGGCCT 
TTCTTCGACA 
CCTGTGGGCA 
CTGGGGGGTC 
GGCATTGTCC 
GGGGGCATGC 
CCCATCACCT 
GTCCTGCCCG 
GCCATCTTCA 
AGCATCACGG 
ACCATGGTCA 
GCCGTGGACT 



60 

120 

180 

240 

300 

360 

42 0 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 



273 



WO 03/003906 



PCT/US02/21338 



10 
15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



GGTTCCTTGA CCGGCTTCGC ACAATGACCA ACGTACTGGG GGACTCAATT GGAGCGGCCG 
TCATCGAGCA CTTGTCTCAG CGGGAGCTGG AGCTTCAGGA AGCTGAGCTT ACCCTCCCCA 
GCCTGGGGAA ACCCTACAAG TCCCTCATGG CACAGGAGAA GGGGGCATCC CGGGGACGGG 
GAGGCAACGA GAGTGCTATG TGAGGGGCCT CCAGCTCTG 

Seq ID NO: 121 Protein Sequence 
Protein Accession ft: NP_005062 



1560 
1620 
1680 



MSSHGNSLFL 
LTVSAWIGV 
NKATGRMGMR 
LIRNMFPPNL 
TRALGTLQEM 
EAIMRLVGI I 
IYFLVTHRNP 
TVNMDGTALY 
TSVGLPTEDI 
KPYKSLMAQE 



11 

I 

RESGQRLGRV 
SLAFALRPYQ 
AAVYYMVTTI 
VEACFKQFKT 
LSFEETVPVP 
IWYAPVGILF 
FPFIGGMLQA 
EALAAIFIAQ 
TLIIAVDWFL 
KGASRGRGGN 



21 
I 

GWLQRLQESL 
LTYRQIKYFS 
IAVFIGILMV 
QYSTRWTRT 
GSANGINALG 
LIAGKILEME 
LITAMGTSSS 
VNNYELNLGQ 
DRLRTMTNVIi 
ESAM 



31 
I 

QQRALRTRLR 
FPGELLMRML 
TIIHPGKGSK 
MVRTENGSEP 
LWFSVAFGL 
DMAVLGGQLG 
SATLPITFRC 
ITTISITATA 
GDSIGAAVIE 



41 
I 

LQTMTLEHVL 
QMLVLPLIVS 
EGLHREGRIE 
GASMPPPFSV 
VIGGMKHKGR 
MYTLTVIVGL 
LEEGLGVDRR 
ASVGAAGIPQ 
HLSQRELELQ 



51 
I 

RFLRRNAFIL 
SLVTGMASLD 
TIPTADAFMD 
ENGTSFLENV 
VLRDFFDSLN 
FLHAGIVLPL 
ITRFVLPVGA 
AGLVTMVIVL 
EAELTLPSLG 



Seq ID NO j 122 DNA sequence 

Nucleic Acid Accession Jh NM_031966.1 

Coding sequence: 178.. 1479 



ACGAACAGGC 
TCTCGGCGTG 
CGGGCCTCCG 
GCGCTCCGAG 
GCAGGCGCAA 
AGAACAGCTC 
ATGAAGAAGG 
CCTCTTGAAA 
GAACCTGAGC 
GTTGATACTG 
CTGTGTCAGG 
GCTGATCCAA 
GAGGAAGAGC 
AGAGCCATCC 
ACCATGTACA 
AAGATGCTGC 
TACCCTCCAG 
ATCAGACAGA 
CCTTTGCACT 
TTGGCCAAAT 
TCTCAAATTG 
ACACCAACTC 
CACCTGGCTA 
AACAAGTATG 
CTAGTTCAAG 
ACTATATTTA 
AAAAAAAAAA 



11 
I 

CAATAAGGAG 
CTGCGGCGGA 
GTGTTCTGCT 
TCACCAGGAA 
AGCGCGTTCC 
TTGGGGACAT 
AAGCAAAACC 
AGGTACCTAT 
CAGAACCTGA 
CCTCTCCAAG 
CTTTCTCTGA 
ACCTTTGTAG 
AAGCAGTCAG 
TAATTGACTG 
TGACTGTCTC 
AGCTGGTTGG 
AAATTGGTGA 
TGGAAATGAA 
TCCTTCGGAG 
ACCTGATGGA 
CAGCAGGAGC 
TACAACATTA 
AGAATGTAGT 
CCACATCGAA 
ATTTAGCCAA 
CAAATAAAAT 
AAAAAAAA 



21 
I 

GGAGCAGTGC 
ACGGCTGTTG 
TCTCCCCGCT 
CTCGAAAATT 
TACGGCCCCT 
TGGTAACAAA 
TTCAGCTACT 
GCTGGTGCCA 
GCCTGTTAAA 
CCCAATGGAA 
TGTAATTCTT 
TGAATATGTG 
ACCAAAATAC 
GCTAGTACAG 
CATTATTGAT 
TGTCACTGCC 
CTTTGCTTTT 
GATTCTAAGA 
AGCATCTAAG 
ACTAACTATG 
TTTTTGCTTA 
CCTGTCATAT 
CATGGTAAAT 
GCATGCTAAG 
GGCTGTGGCA 
TGGCACCATG 



31 
I 

GGGGTTTAAA 
GTTTCTGCTG 
GAGCTGCTGC 
AATGCTGAAA 
GCTGCAACCT 
GTCAGTGAAC 
GGAAAAGTCA 
GTGCCAGTGT 
GAAGAAAAAC 
ACATCTGGAT 
GCAGTAAATG 
AAAGATATTT 
CTACTGGGTC 
GTTCAAATGA 
CGGTTCATGC 
ATGTTTATTG 
GTGACTGACA 
GCTTTAAACT 
ATTGGAGAGG 
TTGGACTATG 
GCACTGAAAA 
ACTGAAGAAT 
CAAGGACTTA 
ATCAGCACTC 
AAGGTGTAAC 
TGCCATCTGT 



41 
I 

TCTGAGGCTA , 
GTTGTAGGTC 
CTGGTGAAGA 
ATAAGGCGAA 
CCAAGCCCGG 
AACTGCAGGC 
TTGATAAAAA 
CTGAGCCAGT 
TTTCGCCTGA 
GTGCCCCTGC 
ATGTGGATGC 
ATGCTTATCT 
GGGAAGTCAC 
AATTCAGGTT 
AGAATAATTG 
CAAGCAAATA 
ACACTTATAC 
TTGGTCTGGG 
TTGATGTCGA 
ACATGGTGCA 
TTCTGGATAA 
CTCTTCTTCC 
CAAAGCACAT 
TACCACAGCT 
TTGTAAACTT 
AAAAAAAAAA 



51 
I 

GGCTGGCTCT 
CTTGGCTGGT 
GGAAGCCATG 
GATCAACATG 
ACTGAGGCCA 
CAAAATGCCT 
ACTACCAAAA 
GCCAGAGCCA 
GCCTATTTTG 
AGAAGAAGAC 
AGAAGATGGA 
GAGACAACTT 
TGGAAACATG 
GTTGCAGGAG 
TGTGCCCAAG 
TGAAGAAATG 
TAAGCACCAA 
TCGGCCTCTA 
GCAACATACT 
CTTTCCTCCT 
TGGTGAATGG 
AGTTATGCAG 
GACTGTCAAG 
GAATTCTGCA 
GAGTTGGAGT 
AAAAAAAAAA 



Seq ID NO: 123 Protein Sequence 
Protein Accession #: NP_114172.1 



I 

MALRVTRNSK 
PMKKEAKPSA 
LVDTASPSPM 
LEEEQAVRPK 
KKMLQLVGVT 
LPLHFLRRAS 
WTPTLQHYLS 
ALVQDLAKAV 



11 
I 

INAENKAKIN 
TGKVIDKKLP 
ETSGCAPAEE 
YLLGREVTGN 
AMFIASKYEE 
KIGEVDVEQH 
YTEESLLPVM 
AKV 



21 

! 

MAGAKRVPTA 
KPLEKVPMLV 
DLCQAFSDVI 
MRAILIDWLV 
MYPPEIGDFA 
TLAKYLMELT 
QHLAKNWMV 



31 
I 

PAATSKPGLR 
PVPVSEPVPE 
LAVNDVDAED 
QVQMKFRLLQ 
FVTDNTYTKH 
MLDYDMVUFP 
NQGLTKHMTV 



41 
I 

PRTALGDIGN 
PEPEPEPEPV 
GADPNLCSEY 
ETMYMTVSII 
QIRQMEMKIL 
PSQIAAGAFC 
KNKYATSKHA 



51 
I 

KVSEQLQAKM 
KEEKLSPEPI 
VKDIYAYLRQ 
DRFMQNNCVP 
RALNFGLGRP 
LALKILDNGE 
KISTLPQLNS 



60 

120 

180 

240 

300 

360 

420 

480 

540 



Seq ID NO i 124 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 13-1424 

1 11 21 31 41 51 

I II I I I 

TAGAAGTTTA CAATGAAGTT TCTTCTAATA CTGCTCCTGC AGGCCACTGC TTCTGGAGCT 
CTTCCCCTGA ACAGCTCTAC AAGCCTGGAA AAAAATAATG TGCTATTTGG TGAAAGATAC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 



60 

120 

180 

240 

300 

360 

420 



60 
120 



274 



WO 03/003906 



PCT/US02/21338 



20 



35 



40 



TTAGAAAAAT TTTATGGCCT TGAGATAAAC AAACTTCCAG TGACAAAAAT GAAATATAGT 
GGAAACTTAA TGAAGGAAAA AATCCAAGAA ATGCAGCACT TCTTGGGTCT GAAAGTGACC 



180 
240 

300 



™««»ril-_ ""^^"^ rtiuv_ALiCAUl IaJTTGGGTCT GAAAGTGACC 

GGGCAACTGG ACACATCTAC CCTGGAGATG ATGCAOGCAC CTCGATGTGG AGTCCCCGAT 

GTCCATCATT TCAGGGAAAT GCCAGGGGGG CCCGTATGGA GGAAACATTA TATCACCTAC 360 

AGAATCAATA ATTACACACC TGACATGAAC CGTGAGGATG TTGACTACGC AATCCGGAAA 

GCTTTCCAAG TATGGAGTAA TGTTACCCCC TTGAAATTCA GCAAGATTAA CACAGGCATG 



420 
480 



™ iVJli ^ k -^^ nwuwnw Ul_AAGATTAA CACAGGCATG 480 

GCTGACATTT TGGTGGTTTT TGCCCGTGGA GCTCATGGAG ACTTCCATGC TTTTGATGGC 540 
AAAGGTGGAA TCCTAGCCCA TGCTTTTGGA CCTGGATCTG GCATTGGAGG GGATGCACAT 600 
1 H ^CGATGAGG ACGAATTCTG GACTACACAT TCAGGAGGCA CAAACTTGTT CCTCACTGCT 660 
IU GTTCACGAGA TTGGCCATTC CTTAGGTCTT GGCCATTCTA GTGATCCAAA GGCCGTAATG 120 
TTCCCCACCT ACAAATATGT TGACATCAAC ACATTTCGCC TCTCTGCTGA TGACATACGT 780 
GGCATTCAGT CCCTGTATGG AGACCCAAAA GAGAACCAAC GCTTGCCAAA TCCTGACAAT 840 
TCAGAACCAG CTCTCTGTGA CCCCAATTTG AGTTTTGATG CTGTCACTAC CGTGGGAAAT 900 
AAGATCTTTT TCTTCAAAGA CAGGTTCTTC TGGCTGAAGG TTTCTGAGAG ACCAAAGACC 960 

LJ AGTGTTAATT TAATTTCTTC CTTATGGCCA ACCTTGCCAT CTGGCATTGA AGCTGCTTAT 1020 

GAAATTGAAG CCAGAAATCA AGTTTTTCTT TTTAAAGATG ACAAATACTG GTTAATTAGC 1080 

AATTTAAGAC CAGAGCCAAA TTATCCCAAG AGCATACATT CTTTTGGTTT TCCTAACTTT 1140 

GTGAAAAAAA TTGATGCAGC TGTTTTTAAC CCACGTTTTT ATAGGACCTA CTTCTTTGTA 1200 

GATAACCAGT ATTGGAGGTA TGATGAAAGG AGACAGATGA TGGACCCTGG TTATCCCAAA 1260 

CTGATTACCA AGAACTTCCA AGGAATCGGG CCTAAAATTG ATGCAGTCTT CTACTCTAAA 1320 

AACAAATACT ACTATTTCTT CCAAGGATCT AACCAATTTG AATATGACTT CCTACTCCAA 1380 

CGTATCACCA AAACACTGAA AAGCAATAGC TGGTTTGGTT GTTGAAAATG GTGTAATTAA 1440 

TGGTTTTTGT TAGTTCACTT CAGCTTAATA AGTATTTATT GCATATTTGC TATGTCCTCA 1500 

GTGTACCACT ACTTAGAGAT ATGTATCATA AAAATAAAAT CTGTAAACCA TAGGTAATGA 1560 

ZJ TTATATAAAA TACATAATAT TTTTCAATTT TGAAAACTCT AATTGTCCAT TCTTGCTTGA 1620 

CTCTACTATT AAGTTTGAAA ATAGTTACCT TCAAAGCAAG ATAATTCTAT TTGAAGCATG 1680 

CTCTGTAAGT TGCTTCCTAA CATCCTTGGA CTGAGAAATT ATACTTACTT CTGGCATAAC 1740 
TAAAATTAAG TATATATATT TTGGCTCAAA TAAAATTG 

30 Seq ID NO: 125 Protein sequence: 
Protein Accession 8: Eos sequence 

] } l 21 31 41 51 

I I I I I I 

MKFLLILLLQ ATASGALPLN SSTSLEKNNV LFGERYLEKF YGLEINKLPV TKMKYSGNLM 60 

KEKIQEMQHF LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR KHYITYRINN 120 

YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL WFARGAHGD FHAFDGKGGI 180 

LAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAVHEI GHSLGLGHSS DPKAVMFPTY 240 

KYVDINTFRL SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSFDA VTTVGNKIFF 300 

FKDRFFWLKV SERPKTSVNL ISSLWPTLPS GIEAAYEIEA RNQVFLFKDD KYWLISNLRP 360 

EPNYPKSIHS FGFPNFVKKI DAAVFNPRFY RTYFFVDNQY WRYDERRQMM DPGYPKLITK 420 
NFQGIGPKID AVFYSKNKYY YFFQGSNQFE YDFLLQRITK TLKSNSWFGC 

Seq ID NO: 126 DNA sequence 
4 J Nucleic Acid Accession If: NMJ)00077.2 
Coding sequencer 272.. 742 

1 11 21 31 41 51 

CA I 1 I I 1 | 

JKJ CCCAACCTGG GGCGACTTCA GGTGTGCCAC ATTCGCTAAG TGCTCGGAGT TAATAGCACC 60 

TCCTCCGAGC ACTCGCTCAC GGCGTCCCCT TGCCTGGAAA GATACCGCGG TCCCTCCAGA 120 

GGATTTGAGG GACAGGGTCG GAGGGGGCTC TTCCGCCAGC ACCGGAGGAA GAAAGAGGAG 180 

GGGCTGGCTG GTCACCAGAG GGTGGGGCGG ACCGCGTGCG CTCGGCGGCT GCGGAGAGGG 240 

GGAGAGCAGG CAGCGGGCGG CGGGGAGCAG CATGGAGCCG GCGGCGGGGA GCAGCATGGA 300 

J J GCCTTCGGCT GACTGGCTGG CCACGGCCGC GGCCCGGGGT CGGGTAGAGG AGGTGCGGGC 360 

GCTGCTGGAG GCGGGGGCGC TGCCCAACGC ACCGAATAGT TACGGTCGGA GGCCGATCCA 420 

GGTCATGATG ATGGGCAGCG CCCGAGTGGC GGAGCTGCTG CTGCTCCACG GCGCGGAGCC 480 

CAACTGCGCC GACCCCGCCA CTCTCACCCG ACCCGTGCAC GACGCTGCCC GGGAGGGCTT 540 

CCTGGACACG CTGGTGGTGC TGCACCGGGC CGGGGCGCGG CTGGACGTGC GCGATGCCTG 600 

OU GGGCCGTCTG CCCGTGGACC TGGCTGAGGA GCTGGGCCAT CGCGATGTCG CACGGTACCT 660 

GCGCGCGGCT GCGGGGGGCA CCAGAGGCAG TAACCATGCC CGCATAGATG CCGCGGAAGG 720 

TCCCTCAGAC ATCCCCGATT GAAAGAACCA GAGAGGCTCT GAGAAACCTC GGGAAACTTA 780 

GATCATCAGT CACCGAAGGT CCTACAGGGC CACAACTGCC CCCGCCACAA CCCACCCCGC 840 

, TTTCGTAGTT TTCATTTAGA AAATAGAGCT TTTAAAAATG TCCTGCCTTT TAACGTAGAT 900 

OJ ATATGCCTTC CCCCACTACC GTAAATGTCC ATTTATATCA TTTTTTATAT ATTCTTATAA 960 

AAATGTAAAA AAGAAAAACA CCGCTTCTGC CTTTTCACTG TGTTGGAGTT TTCTGGAGTG 1020 

AGCACTCACG CCCTAAGCGC ACATTCATGT GGGCATTTCT TGCGAGCCTC GCAGCCTCCG 1080 

GAAGCTGTCG ACTTCATGAC AAGCATTTTG TGAACTAGGG AAGCTCAGGG GGGTTACTGG 1140 

CTTCTCTTGA GTCACACTGC TAGCAAATGG CAGAACCAAA GCTCAAATAA AAATAAAATA 1200 
/U ATTTTCATTC ATTCACTC 

Seq ID NO: 127 Protein Sequence 
Protein Accession #: NP_000068.l 

75 1 11 21 31 41 51 

I i I I I I 

MEPAAGSSME PSADWLATAA ARGRVEEVRA LLEAGALPNA PNSYGRRPIQ VMMMGSARVA 60 

ELLIiLHGAEP NCADPATLTR PVHDAAREGF LDTLWLHRA GARLDVRDAW GRLPVDLAEE 120 

275 



WO 03/003906 PCT/US02/21338 



10 

15 
20 
25 
30 



65 



LGHRDVARYL RAAAGGTRGS NHARIDAAEG PSDIPD 

Seq id NO: 128 DNA sequence 

Nucleic Acid Accession ft: NM_058196.1 

Coding sequences 104.. 421 



1 


11 


21 


31 
1 


41 
1 


51 
t 




1 

TGTGTGGGGG 


1 

TCTGCTTGGC 


1 

GGTGAGGGGG 


1 

CTCTACACAA 


1 

GCTTCCTTTC 


1 

CGTCATGCCG 


60 


GCCCCCACCC 


TGGCTCTGAC 


CATTCTGTTC 


TCTCTGGCAG 


GTCATGATGA 




120 


CCGAGTGGCG 


GAGCTGCTGC 


TGCTCCACGG 


CGCGGAGCCC 


AACTGCGCCG 


ACCCCGCCAC 


180 


TCTCACCCGA 


CCCGTGCACG 


ACGCTGCCCG 


GGAGGGCTTC 


CTGGACACGC 


TGGTGGTGCT 


240 


GCACCGGGCC 


GGGGCGCGGC 


TGGACGTGCG 


CGATGCCTGG 


GGCCGTCTGC 


CCGTGGACCT 


300 


GGCTGAGGAG 


CTGGGCCATC 


GCGATGTCGC 


ACGGTACCTG 


CGCGCGGCTG 


CGGGGGGCAC 


360 


CAGAGGCAGT 


AACCATGCCC 


GCATAGATGC 


CGCGGAAGGT 


CCCTCAGACA 


TCCCCGATTG 


420 


AAAGAACCAG 


AGAGGCTCTG 


AGAAACCTCG 


GGAAACTTAG 


ATCATCAGTC 


ACCGAAGGTC 


480 


CTACAGGGCC 


ACAACTGCCC 


CCGCCACAAC 


CCACCCCGCT 


TTCGTAGTTT 


TCATTTAGAA 


540 


AATAGAGCTT 


TTAAAAATGT 


CCTGCCTTTT 


AACGTAGATA 


TAAGCCTTCC 


CCCACTACCG 


600 


TAAATGTCCA 


TTTATATCAT 


TTTTTATATA 


TTCTTATAAA 


AATGTAAAAA 


AGAAAAACAC 


660 


CGCTTCTGCC 


TTTTCACTGT 


GTTGGAGTTT 


TCTGGAGTGA 


GCACTCACGC 


CCTAAGCGCA 


720 


CATTCATGTG 


GGCATTTCTT 


GCGAGCCTCG 


CAGCCTCCGG 


AAGCTGTCGA 


CTTCATGACA 


780 


AGCATTTTGT 


GAACTAGGGA 


AGCTCAGGGG 


GGTTACTGGC 


TTCTCTTGAG 


TCACACTGCT 


840 


AGCAAATGGC 


AGAACCAAAG 


CTCAAATAAA 


AATAAAATAA 


TTTTCATTCA 


TTCACTC 




Seq ID NOt 


129 Protein Sequence 










Protein Accession ft: NP_478 103.1 










1 


11 


21 


31 


41 


51 




1 

MMMGSARVAE 


1 

LLLLHGAEPN 


1 

CADPATLTRP 


1 

VHDAAREGFL 


1 

DTLWLHRAG 


1 

ARLDVRDAWG 


60 



RLPVDLAEEL GHRDVARYLR AAAGGTRGSN HARIDAAEGP SDIPD 



Seq ID NO: 130 DNA sequence 
Nucleic Acid Accession ft: NM_058197.1 
35 Coding sequence: 272.. 646 

1 11 21 31 41 51 

I I I I I I 

CCCAACCTGG GGCGACTTCA GGTGTGCCAC ATTCGCTAAG TGCTCGGAGT TAATAGCACC 60 

40 TCCTCCGAGC ACTCGCTCAC GGCGTCCCCT TGCCTGGAAA GATACCGCGG TCCCTCCAGA 120 

GGATTTGAGG GACAGGGTCG GAGGGGGCTC TTCCGCCAGC ACCGGAGGAA GAAAGAGGAG 180 

GGGCTGGCTG GTCACCAGAG GGTGGGGCGG ACCGCGTGCG CTCGGCGGCT GCGGAGAGGG 240 

GGAGAGCAGG CAGCGGGCGG CGGGGAGCAG CATGGAGCCG GCGGCGGGGA GCAGCATGGA 300 

GCCGGCGGCG GGGAGCAGCA TGGAGCCTTC GGCTGACTGG CTGGCCACGG CCGCGGCCCG 360 

45 GGGTCGGGTA GAGGAGGTGC GGGCGCTGCT GGAGGCGGGG GCGCTGCCCA ACGCACCGAA 420 

TAGTTACGGT CGGAGGCCGA TCCAGGTGGG TAGAAGGTCT GCAGCGGGAG CAGGGGATGG 480 

CGGGCGACTC TGGAGGACGA AGTTTGCAGG GGAATTGGAA TCAGGTAGCG CTTCGATTCT 540 

CCGGAAAAAG GGGAGGCTTC CTGGGGAGTT TTCAGAAGGG GTTTGTAATC ACAGACCTCC 600 

TCCTGGCGAC GCCCTGGGGG CTTGGGAAAC CAAGGAAGAG GAATGAGGAG CCACGCGCGT 660 

50 ACAGATCTCT CGAATGCTGA GAAGATCTGA AGGGGGGAAC ATATTTGTAT TAGATGGAAG 720 

TCATGATGAT GGGCAGCGCC CGAGTGGCGG AGCTGCTGCT GCTCCACGGC GCGGAGCCCA 780 

ACTGCGCCGA CCCCGCCACT CTCACCCGAC CCGTGCACGA CGCTGCCCGG GAGGGCTTCC 840 

TGGACACGCT GGTGGTGCTG CACCGGGCCG GGGCGCGGCT GGACGTGCGC GATGCCTGGG 900 

GCCGTCTGCC CGTGGACCTG GCTGAGGAGC TGGGCCATCG CGATGTCGCA CGGTACCTGC 960 

55 GCGCGGCTGC GGGGGGCACC AGAGGCAGTA ACCATGCCCG CATAGATGCC GCGGAAGGTC 1020 

CCTCAGACAT CCCCGATTGA AAGAACCAGA GAGGCTCTGA GAAACCTCGG GAACTTAGAT 1080 

CATCAGTCAC CGAAGGTCCT ACAGGGCCAC AACTGCCCCC GCCACAACCC ACCCCGCTTT 1140 

CGTAGTTTTC ATTTAGAAAA TAGAGCTTTT AAAAATGTCC TGCCTTTTAA CGTAGATATA 1200 

, A TGCCTTCCCC CACTACCGTA AATGTCCATT TATATCATTT TTTATATATT CTTATAAAAA 1260 

60 TGTAAAAAAG AAAAACACCG CTTCTGCCTT TTCACTGTGT TGGAGTTTTC TGGAGTGAGC 1320 

ACTCACGCCC TAAGCGCACA TTCATGTGGG CATTTCTTGC GAGCCTCGCA GCCTCCGGAA 1380 

GCTGTCGACT TCATGACAAG CATTTTGTGA ACTAGGGAAG CTCAGGGGGG TTACTGGCTT 1440 

CTCTTGAGTC ACACTGCTAG CAAATGGCAG AACCAAAGCT CAAATAAAAA TAAAATAATT 1500 
TTCATTCATT CACTC 



Seq ID NO: 131 Protein Sequence 
Protein Accession ft: NP_478104.1 



„ 1 11 21 31 41 SI 

70 j | | III 

MEPAAGSSME PAAGSSMEPS ADWLATAAAR GRVEEVRALL EAGALPNAPN SYGRRPIQVG 60 
RRSAAGAGDG GRLWRTKFAG ELESGSASIL RKKGRLPGEF SEGVCNHRPP PGDALGAWET 120 
KEEE 

75 Seq ID NO: 132 DNA sequence 

Nucleic Acid Accession ft: NM_058195.1 
Coding sequence: 163.. 684 

276 



WO 03/003906 



PCT/US02/21338 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



CCTCCCTACG 

GATCTTGGAG 

GGGCGCGCTC 

GTGGGTCCCA 

GGCGGCGCAG 

TTGGTGACCC 

CACATCCCGC 

GTGCTGATGC 

CATGATGATG 

CTGCGCCGAC 

GGACACGCTG 

CCGTCTGCCC 

CGCGGCTGCG 

CTCAGACATC 

CATCAGTCAC 

CGTAGTTTTC 

TGCCTTCCCC 

TGTAAAAAAG 

ACTCACGCCC 

GCTGTCGACT 

CTCTTGAGTC 

TTCATTCATT 



11 
I 

GGCGCCTCCG 
GTCCGGGTGG 
AGGGAAGGCG 
GTCTGCAGTT 
CGGCTGCCGA 
TCCGGATTCG 
GGCTCACGGG 
TACTGAGGAG 
GGCAGCGCCC 
CCCGCCACTC 
GTGGTGCTGC 
GTGGACCTGG 
GGGGGCACCA 
CCCGATTGAA 
CGAAGGTCCT 
ATTTAGAAAA 
CACTACCGTA 
AAAAACACCG 
TAAGCGCACA 
TCATGACAAG 
ACACTGCTAG 
CACTC 



21 
I 

GCAGCCCTTC 
GAGTGGGGGT 
GGTGCGCGCC 
AAGGGGGCAG 
GCTCGGCCCT 
GCGCGCGTGC 
GGAGTGGGCA 
CCAGCGTCTA 
GAGTGGCGGA 
TCACCCGACC 
ACCGGGCCGG 
CTGAGGAGCT 
GAGGCAGTAA 
AGAACCAGAG 
ACAGGGCCAC 
TAGAGCTTTT 
AATGTCCATT 
CTTCTGCCTT 
TTCATGTGGG 
CATTTTGTGA 
CAAATGGCAG 



31 
I 

CCGCGTGCGC 
GGGGTGGGGG 
TGCGGGGCGG 
GAGTGGCGCT 
GGAGGCGGCG 
GGCCCGCCGC 
GCGCCAGGGG 
GGGCAGCAGC 
GCTGCTGCTG 
CGTGCACGAC 
GGCGCGGCTG 
GGGCCATCGC 
CCATGCCCGC 
AGGCTCTGAG 
AACTGCCCCC 
AAAAATGTCC 
TATATCATTT 
TTCACTGTGT 
CATTTCTTGC 
ACTAGGGAAG 
AACCAAAGCT 



41 
I 

AGGGCTCAGA 
TGGGGGTGAA 
AGATGGGCAG 
GCTCACCTCT 
AGAACATGGT 
GAGTGAGGGT 
CGCCCGCCGC 
CGCTTCCTAG 
CTCCACGGCG 
GCTGCCCGGG 
GACGTGCGCG 
GATGTCGCAC 
ATAGATGCCG 
AAACCTCGGG 
GCCACAACCC 
TGCCTTTTAA 
TTTATATATT 
TGGAGTTTTC 
GAGCCTCGCA 
CTCAGGGGGG 
CAAATAAAAA 



51 
I 

GCCGTTCCGA 
GGTGGGGGGC 
GGGGCGGTGC 
GGTGCCAAAG 
GCGCAGGTTC 
TTTCGTGGTT 
TGTGGCCCTC 
AAGACCAGGT 
CGGAGCCCAA 
AGGGCTTCCT 
ATGCCTGGGG 
GGTACCTGCG 
CGGAAGGTCC 
AAACTTAGAT 
ACCCCGCTTT 
CGTAGATATA 
CTTATAAAAA 
TGGAGTGAGC 
GCCTCCGGAA 
TTACTGGCTT 
TAAAATAATT 



51 



Seq ID NO: 133 Protein Sequence 
Protein Accession ft: NP_478102.1 

} " 21 31 41 

I I I I t I 

^i RGQEWRC SPLVP KGGAA AAELGPGGGE NMVRRFLVTL RIRRACGPPR 
VRVFWHIPR LTGEWAAPGA PAAVALVLML LRSQRLGQQP LPRRPGHDDG QRPSGGAAAA 
PRRGAQLRRP RHSHPTRARR CPGGLPGHAG GAAPGRGAAG RARCLGPSAR gkT^ 

Seq ID NO: 134 DNA sequence 
Nucleic Acid Accession Jh NM_023 915 
Coding sequence: 1..1077 



ATGGGGTTCA 
CACAATTCAG 
GACACAATTG 
GGTTTAGCAG 
AAAAACATAG 
GATGCAGGAT 
TTTTATGCAA 
CTGAAGGTGG 
TTATCTGTTT 
AATGGTCAGC 
GTCAAATGGC 
ATTCTGATCG 
ATAAGTCAGT 
TTTTTTACCT 
GACAGGCTTT 
TTCTTGTCTG 
TTTTCAAGAA 
CTGCAAAGTG 



11 
I 

ACTTGACGCT 
GCAACAGGAG 
TCTTGCCGGT 
TGTGGATCTT 
TGGTTGCAGA 
TTGGACCTTG 
ACATGTATAC 
TCAAGCCATT 
GTGTTTGGGT 
CAACAGAGGA 
ATACGGCAGT 
GATGTTACAT 
CAAGCCGAAA 
GCTTTCTACC 
TAGATGAATC 
CGTGTAATGT 
GGCTGTTCAA 
TGAGAAGATC 



21 
I 

TGCAAAATTA 
CGACGGGCCA 
GCTTTATCTC 
CTTCCACATT 
CCTCATAATG 
GTACTTCAAG 
TTCCATCGTG 
TGGGGACTCT 
GATCATGGCT 
CAATATCCAT 
CACCTATGTG 
AGCCATATCC 
GCGAAAACAT 
ATATCACTTG 
TGCACAAAAA 
TTGCCTGGAT 
AAAATCAAAT 
GGAAGTTCGC 



31 
I 

CCAAATAACG 
GGAAAGAACA 
ATTATATTTG 
AGGAATAAAA 
ACGCTGACAT 
TTTATTCTCT 
TTCCTTGGGC 
CGGATGTACA 
GTTTTGTCTT 
GACTGCTCAA 
AACAGCTGCT 
AGGTACATCC 
AACCAGAGCA 
TGCAGAATTC 
ATCCTATATT 
CCAATAATTT 
ATCAGAACCA 
ATATATTATG 



41 
I 

AGCTGCACGG 
CCACCCTTCA 
TGGCAAGCAT 
CCAGCTTCAT 
TTCCATTTCG 
GCAGATACAC 
TGATAAGCAT 
GCATAACCTT 
TGCCAAACAT 
AACTTAAAAG 
TGTTTGTGGC 
ACAAATCCAG 
TCAGGGTTGT 
CTTTTACTTT 
ACTGCAAAGA 
ACTTTTTCAT 
GGAGTGAAAG 
ATTACACTGA 



51 
I 

CCAAGAGAGT 
CAATGAATTT 
CTTGCTGAAT 
ATTCTATCTC 
AATAGTCCAT 
TTCAGTTTTG 
TGATCGCTAT 
CACGAAGGTT 
CATCCTAACA 
TCCTTTGGGG 
CGTGCTGGTG 
CAGGCAATTC 
TGTGGCTGTG 
TAGTCACTTA 
AATTACACTT 
GTGTAGGTCA 
CATCAGATCA 
TGTGTAA 



Seq ID NO: 135 Protein Sequence 
Protein Accession ft: NP 076404 



MGFNLTLAKL 
GLAVWIFFHI 
FYANMYTSIV 
NGQPTEDNIH 
ISQSSRKRKH 
FLSACNVCLD 



11 
I 

PNNELHGQES 
RNKTSFIFYL 
FLGLISIDRY 
DCSKLKSPLG 
NQSIRVWAV 
PIIYFFMCRS 



21 

! 

HNSGNRSDGP 
KNIWADLIM 
LKWKPFGDS 
VKWHTAVTYV 
FFTCFLPYHL 
FSRRLFKKSN 



31 
I 

GKNTTLHNEF 
TLTFPFRIVH 
RMYSITFTKV 
NSCLFVAVLV 
CRIPFTFSHL 
IRTRSESIRS 



41 
I 

DTIVLPVLYL 
DAGFGPWYFK 
LSVCVWVIMA 
ILIGCYIAIS 
DRLLDESAQK 
LQSVRRSEVR 



51 
I 

IIFVASILLN 
FILCRYTSVL 
VLSIiPNIILT 
RYIHKSSRQF 
ILYYCKEITL 
IYYDYTDV 



Seq ID NO: 136 DNA sequence 

Nucleic Acid Accession ft: NM_005329.1 

Coding sequence: 1..1662 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 



60 
120 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 



60 

120 

180 

240 

300 



11 



21 



31 
I 



51 



277 
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ATGCCGGTGC AGCTGACGAC AGCCCTGCGT GTGGTGGGCA CCAGCCTGTT TGCCCTGGCA 60 

GTGCTGGGTG GCATCCTGGC AGCCTATGTG ACGGGCTACC AGTTCATCCA CACGGAAAAG 120 

CACTACCTGT CCTTCGGCCT GTACGGCGCC ATCCTGGGCC TGCACCTGCT CATTCAGAGC 180 

CTTTTTGCCT TCCTGGAGCA CCGGCGCATG CGACGTGCCG GCCAGGCCCT GAAGCTGCCC 240 

5 TCCCCGCGGC GGGGCTCGGT GGCACTGTGC ATTGCCGCGT ACCAGGAGGA CCCTGACTAC 300 

TTGCGCAAGT GCCTGCGCTC GGCCCAGCGC ATCTCCTTCC CTGACCTCAA GGTGGTCATG 360 

GTGGTGGATG GCAACCGCCA GGAGGACGCC TACATGCTGG ACATCTTCCA CGAGGTGCTG 420 

GGCGGCACCG AGCAGGCCGG CTTCTTTGTG TGGCGCAGCA ACTTCCATGA GGCAGGCGAG 480 

GGTGAGACGG AGGCCAGCCT GCAGGAGGGC ATGGACCGTG TGCGGGATGT GGTGCGGGCC S40 

10 AGCACCTTCT CGTGCATCAT GCAGAAGTGG GGAGGCAAGC GCGAGGTCAT GTACACGGCC 600 

TTCAAGGCCC TCGGCGATTC GGTGGACTAC ATCCAGGTGT GCGACTCTGA CACTGTGCTG 660 

GATCCAGCCT GCACCATCGA GATGCTTCGA GTCCTGGAGG AGGATCCCCA AGTAGGGGGA 720 

GTCGGGGGAG ATGTCCAGAT CCTCAACAAG TACGACTCAT GGATTTCCTT CCTGAGCAGC 780 

GTGCGGTACT GGATGGCCTT CAACGTGGAG CGGGCCTGCC AGTCCTACTT TGGCTGTGTG 840 

15 CAGTGTATTA GTGGGCCCTT GGGCATGTAC CGCAACAGCC TCCTCCAGCA GTTCCTGGAG 900 

GACTGGTACC ATCAGAAGTT CCTAGGCAGC AAGTGCAGCT TCGGGGATGA CCGGCACCTC 960 

ACCAACCGAG TCCTGAGCCT TGGCTACCGA ACTAAGTATA CCGCGCGCTC CAAGTGCCTC 1020 

ACAGAGACCC CCACTAAGTA CCTCCGGTGG CTCAACCAGC AAACCCGCTG GAGCAAGTCT 1080 

TACTTCCGGG AGTGGCTCTA CAACTCTCTG TGGTTCCATA AGCACCACCT CTGGATGACC 1140 

20 TACGAGTCAG TGGTCACGGG TTTCTTCCCC TTCTTCCTCA TTGCCACGGT TATACAGCTT 1200 

TTCTACCGGG GCCGCATCTG GAACATTCTC CTCTTCCTGC TGACGGTGCA GCTGGTGGGC 1260 

ATTATCAAGG CCACCTACGC CTGCTTCCTT CGGGGCAATG CAGAGATGAT CTTCATGTCC 1320 

CTCTACTCCC TCCTCTATAT GTCCAGCCTT CTGCCGGCCA AGATCTTTGC CATTGCTACC 1380 

ATCAACAAAT CTGGCTGGGG CACCTCTGGC CGAAAAACCA TTGTGGTGAA CTTCATTGGC 1440 

25 CTCATTCCTG TGTCCATCTG GGTGGCAGTT CTCCTGGGAG GGCTGGCCTA CACAGCTTAT 1500 

TGCCAGGACC TGTTCAGTGA GACAGAGCTA GCCTTCCTTG TCTCTGGGGC TATACTGTAT 1560 

GGCTGCTACT GGGTGGCCCT CCTCATGCTA TATCTGGCCA TCATCGCCCG GCGATGTGGG 1620 
AAGAAGCCGG AGCAGTACAG CTTGGCTTTT GCTGAGGTGT GA 

30 Seq ID NO: 137 Protein Sequence 
Protein Accession #: NPJJ05320.1 

1 11 21 31 41 51 

, c 1 1 I I I I 

J J MPVQLTTALR WGTSLPALA VLGGILAAYV TGYQFIHTEK HYLSFGLYGA ILGLHLLIQS 60 

LFAFLEHRRM RRAGQALKLP SPRRGSVALC IAAYQEDPDY LRKCLRSAQR ISFPDLKWM 120 

WDGNRQEDA YMLDIFHEVL GGTEQAGFFV WRSNFHEAGE GETEASLQEG MDRVRDWRA 180 

STFSCIMQKW GGKREVMYTA FKALGDSVDY IQVCDSDTVL DPACTIEMLR VLEEDPQVGG 240 

VGGDVQILNK YDSWISFLSS VRYWMAFNVE RACQSYFGCV QCISGPLGMY RNSLLQQFLE 300 

40 DWYHQKFLGS KCSFGDDRHL TNRVLSLGYR TKYTARSKCL TETPTKYLRW LNQQTRWSKS 360 

YFREWLYNSL WFHKHHLWMT YESWTGFFP FFLIATVIQL FYRGRIWNIL LFIiLTVQLVG 420 

IIKATYACFL RGNAEMIFMS LYSLLYMSSL LPAKIFAIAT INKSGWGTSG RKTIWNFIG 480 

LIPVSIWVAV LLGGLAYTAY CODLFSETEL AFLVSGAILY GCYWVALLML YLAIIARRCG 540 
KKPEQYSLAF AEV 



45 



50 



75 



Seq ID' NO: 138 DNA Sequence 

Nucleic Acid Accession #: NM_001327.1 

Coding sequence: 89.. 631 



1 11 21 31 41 51 

1 I I I I I 

AGCAGGGGGC GCTGTGTGTA CCGAGAATAC GAGAATACCT CGTGGGCCCT GACCTTCTCT 60 

CTGAGAGCCG GGCAGAGGCT CCGGAGCCAT GCAGGCCGAA GGCCGGGGCA CAGGGGGTTC 120 

55 GACGGGCGAT GCTGATGGCC CAGGAGGCCC TGGCATTCCT GATGGCCCAG GGGGCAATGC 180 

TGGCGGCCCA GGAGAGGCGG GTGCCACGGG CGGCAGAGGT CCCCGGGGCG CAGGGGCAGC 240 

AAGGGCCTCG GGGCCGGGAG GAGGCGCCCC GCGGGGTCCG CATGGCGGCG CGGCTTCAGG 300 

GCTGAATGGA TGCTGCAGAT GCGGGGCCAG GGGGCCGGAG AGCCGCCTGC TTGAGTTCTA 360 

CCTCGCCATG CCTTTCGCGA CACCCATGGA AGCAGAGCTG GCCCGCAGGA GCCTGGCCCA 420 

60 GGATGCCCCA CCGCTTCCCG TGCCAGGGGT GCTTCTGAAG GAGTTCACTG TGTCCGGCAA 480 

CATACTGACT ATCCGACTGA CTGCTGCAGA CCACCGCCAA CTGCAGCTCT CCATCAGCTC 540 

CTGTCTCCAG CAGCTTTCCC TGTTGATGTG GATCACGCAG TGCTTTCTGC CCGTGTTTTT 600 

GGCTCAGCCT CCCTCAGGGC AGAGGCGCTA AGCCCAGCCT GGCGCCCCTT CCTAGGTCAT 660 

GCCTCCTCCC CTAGGGAATG GTCCCAGCAC GAGTGGCCAG TTCATTGTGG GGGCCTGATT 720 

65 GTTTGTCGCT GGAGGAGGAC GGCTTACATG TTTGTTTCTG TAGAAAATAA AACTGAGCTA 780 

Seq ID NO: 13 9 Protein Sequence 
Protein Accession #: NPJJ01318.1 

70 1 11 21 31 41 51 

] II I I I 

MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPGGGA 60 

PRGPHGGAAS GLNGCCRCGA RGPESRLLEF YLAMPFATPM EAELARRSLA QDAPPLPVPG 120 

VLLKEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSLLM WITQCFLPVF LAQPPSGQRR 180 



Seq ID NO: 140 DNA Sequence 

Nucleic Acid Accession #« Eos sequence 



278 



WO 03/003906 



Coding sequence: S3.. 459 

) " ?1 31 41 5i 

II I l|t 

CCTCGTGGGC CCTGACCTTC TCTCTGAGAG CCGGGCAGAG GCTCCGGAGC CATGCAGGCC 
GAAGGCCAGG GCACAGGGGG TTCGACGGGC GATGCTGATG GCCCAGGAGG CCCTGGCATT 
CCTGATGGCC CAGGGGGCAA TGCTGGCGGC CCAGGAGAGG CGGGTGCCAC GGGCGGCAGA 

GCGCAGGGGC AGCAAGGGCC TCGGGGCCGA GAGGAGGCGC CCCGCGGGGT 240 
CCGCATGGCG GTGCCGCTTC TGCGCAGGAT GGAAGGTGCC CCTGCGGGGC CAGGAGGCCG 300 
^^ CC TGCTTCAGTT CCGACTGACT GCTGCAGACC ACCGCCAACT GCAGCTCTCC 360 
-t.^^ TCCT GTCTCCAGCA GCTTTCCCTG TTGATGTGGA TCACGCAGTG CTTTCTGCCC 420 
GTG7TTTTGG CTCAGGCTCC CTCAGGGCAG AGGCGCTAAG CCCAGCCTGG CGCCCCTTCC 480 
TAGGTCATGC CTCCTCCCCT AGGGAATGGT CCCAGCACGA GTGGCCAGTT CATTGTGGGG 
GCCTGATTGT TTGTCGCTGG AGGAGGACGG CTTACATGTT TGTTTCTGTA GAAAATAAAG 
^- 1 vAGCTA 

Seq id NO: 141 Protein Sequence 
Protein Accession ft: Eos sequence 

J I 1 21 31 41 51 

II | ill 

MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 60 
PRGPHGGAAS AQDGRCPCGA RRPDSRLLQF RLTAADHRQL QLSISSCLQQ LSLIdWITQC 120 
FLPVFLAOAP SGQRR 



60 

120 

180 



540 
600 



Seq ID NO: 142 DNA Sequence 

Nucleic Acid Accession #: NM_130467.2 

Coding sequencer 2 4 6.. 63 8 

i i 1 f r t r 

AGGCAGAGCT CTGCAAGGAG AGGTTGTGTC TTCGTTCTTT CCGCCATCTT CGTTCTTTCC 60 

AACATCTTCG TTCTTTCTCA CTGACCGAGA CTCAGCCGGT AGGTCTGCAG AGTGGTCTTC 120 

CTGGTAATTT AGTTGTGAGT GAATGTGTGG AGGAGCCAGC GGGCTTAGGA CAGGTCCTGT 180 

GGCACAGTCC GTGGCTTTGA GGGAAAAGGG CCTCGCGGTG GTCCTCCGCC TTCCCCCAGG 240 

TCGTGATGCA GGCGCCATGG GCCGGTAATC GTGGCTGGGC TGGAACGAGG GAGGAAGTGA 300 

GAGATATGAG TGAGCATGTA ACAAGATCCC AATCCTCAGA AAGAGGAAAT GACCAAGAGT 360 

CTTCCCAGCC AGTTGGACCT GTGATTGTCC AGCAGCCCAC TGAGGAAAAA CGTCAAGAAG 42 0 

AGGAACCACC AACTGATAAT CAGGGTATTG CACCTAGTGG GGAGATCAAA AATGAAGGAG 480 

CACCTGCTGT TCAAGGGACT GATGTGGAAG CTTTTCAACA GGAACTGGCT CTGCTTAAGA ■ 540 

TAGAGGATGC ACCTGGAGAT GGTCCTGATG TCAGGGAGGG GACTCTGCCC ACTTTTGATC 600 

CCACTAAAGT GCTGGAAGCA GGTGAAGGGC AACTATAGGT TTAAACCAAG ACAAATGAAG 660 

ACTGAAACCA AGAATATTGT TCTTATGCTG GAAATTTGAC TGCTAACATT CTCTTAATAA 720 
AGTTTTACAG TTTTC 

Seq ID NO: 143 Protein Sequence 
Protein Accession #: NP_569734 

I 11 21 31 41 51 

II I III 

MSEHVTRSQS SERGNDQESS QPVGPVIVQQ PTEEKRQEEE PPTDNQGIAP SGEIKNEGAP 60 
AVQGTDVEAF QQELALLKIE DAPGDGPDVR EGTLPTFDPT KVLEAGEGQL 

Seq ID NO: 144 DNA Sequence 

Nucleic Acid Accession NM_001476.1 

Coding sequence: 82.. 435 

i " i 1 f r r 

GCCAGGGAGC TGTGAGGCAG TGCTGTGTGG TTCCTGCCGT CCGGACTCTT TTTCCTCTAC 60 

TGAGATTCAT CTGTGTGAAA TATGAGTTGG CGAGGAAGAT CGACCTATTA TTGGCCTAGA 120 

CCAAGGCGCT ATGTACAGCC TCCTGAAGTG ATTGGGCCTA TGCGGCCCGA GCAGTTCAGT 180 

GATGAAGTGG AACCAGCAAC ACCTGAAGAA GGGGAACCAG CAACTCAACG TCAGGATCCT 240 

GCAGCTGCTC AGGAGGGAGA GGATGAGGGA GCATCTGCAG GTCAAGGGCC GAAGCCTGAA 300 

GCTGATAGCC AGGAACAGGG TCACCCACAG ACTGGGTGTG AGTGTGAAGA TGGTCCTGAT 360 

GGGCAGGAGG TGGACCCGCC AAATCCAGAG GAGGTGAAAA CGCCTGAAGA AGGTGAAAAG 420 



^..-^ " -w^wi-ivj unuuiuHHrtrt u\j^t:iUftAiiA AGGTGAAAAG 

CAATCACAGT GTTAAAAGAA GACACGTTGA AATGATGCAG GCTGCTCCTA TGTTGGAAAT 
TTGTTCATTA AAATTCTCCC AATAAAGCTT TACAGCCTTC TGCAAAA 



Seq ID NO: 145 Protein Sequence 
Protein Accession Jh.NP 001467.1 



31 41 si 

I 



MSWRGRSTYY WPRPRRYVQP PEVIGPMRPE QFSDEVEPAT PEEGEPATQR QDPAAAOEGE 
DEGASAGQGP KPEADSQEQG HPQTGCECED GPDGQBVDPP NPEEVKTPEE GEKQSQC 



480 



279 



WO 03/003906 



PCT/US02/21338 



Seq ID NO: 146 DNA Sequence 
Nucleic Acid Accession NM_005S62 
J Coding sequence: 90.. 3671 

1 11 21 31 41 51 

I I I I I I 

ACAGCGGAGC GCAGAGTGAG AACCACCAAC CGAGGCGCCG GGCAGCGACC CCTGCAGCGG 60 

10 AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 120 

GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA 180 

ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT 240 

TCCGCTGCCT CAACTGCAAT GACAACACTG ATGGCATTCA CTGCGAGAAG TGCAAGAATG 300 

GCTTTTACCG GCACAGAGAA AGGGACCGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 360 

15 CTCTTAGTGC TCGATGTGAC AACTCTGGAC GGTGCAGCTG TAAACCAGGT GTGACAGGAG 420 

CCAGATGCGA CCGATGTCTG CCAGGCTTCC ACATGCTCAC GGATGCGGGG TGCACCCAAG 480 

ACCAGAGACT GCTAGACTCC AAGTGTGACT GTGACCCAGC TGGCATCGCA GGGCCCTGTG 540 

ACGCGGGCCG CTGTGTCTGC AAGCCAGCTG TTACTGGAGA ACGCTGTGAT AGGTGTCGAT 600 

CAGGTTACTA TAATCTGGAT GGGGGGAACC CTGAGGGCTG TACCCAGTGT TTCTGCTATG 660 

20 GGCATTCAGC CAGCTGCCGC AGCTCTGCAG AATACAGTGT CCATAAGATC ACCTCTACCT 720 

TTCATCAAGA TGTTGATGGC TGGAAGGCTG TCCAACGAAA TGGGTCTCCT GCAAAGCTCC 780 

AATGGTCACA GCGCCATCAA GATGTGTTTA GCTCAGCCCA ACGACTAGAC CCTGTCTATT 840 

TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA AGCCTGTCCT 900 

. TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTG ATTCTGGAAG 960 

25 GTGCTGGTCT ACGGATCACA GCTCCCTTGA TGCCACTTGG CAAGACACTG CCTTGTGGGC 1020 

TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCCAAG CAATAATTGG AGCCCCCAGC 1080 

TGAGTTACTT TGAGTATCGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCCGAGCTA 1140 

CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG 1200 

TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACAAGGGGC 1260 

30 AATTCTGCCA GGATTGTGCT TCTGGCTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG 1320 

GCACCTGTAT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGATCCAGAC ACAGGAGATT 1380 

GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTGA CTGCCCAATT GGTTTCTACA 1440 

ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 1500 

CAGTGATGCC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 

35 CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 1620 

TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 

GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTACTGCG 1740 

ACCAGTGCAA AGCAGGCTAC TTCGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 

GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCGAAGT GATGGCACCT 1860 

40 GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 1920 

CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 

AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 

GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 

AAGGTGCTAG CAGATCCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 

45 ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTTCGGGCT CTGGGAAGTC 2220 

AGTACCAGAA CCGAGTTCGG GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 2280 

CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 2 340 

CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 

. CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 

50 CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CCGGACGGTG 2520 

CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 2580 

CAAGGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCCGCC 2640 

TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 2700 

CAAAGAGGAT CAAACAAAAA GCGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 2760 

55 AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 2820 

AGAATGGAAA AAGTGGGAGA GAGAAATCAG ATCAGCTGCT TTCCCGTGCC AATCTTGCTA 2880 

AAAGCAGAGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2940 

TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 

^ AAGCCATGAA GAGACTCTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 

60 AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAATGGGG 3120 

CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 

AAGCCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTGGCC TCTCTGAAGA 3240 

GTGAGATGAG GGAAGTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 

r TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAACG .3360 

65 CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 3420 

AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA GCAGAAGCTT TCCCGAGCCA 3480 

AGACCCAGAT CAACAGCCAA CTGCGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3540 

AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3600 

_ A AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CTACAATACC CAGGCTCTTG 3660 

70 AGCAACAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT TCTTGGGATA CAGATCTCAG 3720 

GGCTCGGGAG CCATGTCATG TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 3780 

TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3840 

TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3900 

ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 3 960 

75 ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 4020 

ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGAAAA AACTTTGACT TTGCCCAGGC 4 080 

ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 

ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 

280 



WO 03/003906 



ACCCAGGGTG 
AGGACCTGTA 
GTTCTGGACC 
ATTTTTATTA 
GTTTCAAAGT 
ATTAGTCCTA 
ATCTTATTTT 
CACACTTCAG 
TTACCTCCAT 
GTGGGACAGT 
AGCATTTTTA 
GCAATAACCG 
CATGGGGGCA 
CATTCCAGCT 
TAACACCAGT 
TGGTGCTGCC 
CAATTGTTAG 



TGAACATGTT 
AGGCAGGCCC 
TGGGCATGAC 
AAGCATTTCC 
GATAGAAAAG 
ATTCAATCCT 
CTCAATCTCC 
CTGGGTCACA 
CCATCCTTCC 
GGTGACATAG 
AAAAATAAAT 
CTTGGTTTGC 
CTTGAGTTTT 
GTCACTCTGT 
GGGAATTGCT 
TTGCTTCTGT 
ATGCC 



CTCCATTTTC 
ATTCAGAGCT 
ATCCTTTCTT 
TACCAGCAAA 
TGTGGCTTGG 
ACTTTTCGAA 
TCTCTCTTTC 
TCCATCCCTC 
AACATATATT 
TCTCTGCCCT 
TTAAACTTAC 
AACCTCTTTG 
GGCAAGGCTG 
GCCTTTCTAC 
GGAGGAACCA 
ATTTCCTTGG 



AAGCTGGAAG 
ATGGTGCTTG 
TTAATGATGC 
GCAAATGTTG 
GCATTGAAAG 
CACCAAAAAT 
CTCCACCCAT 
CATTCATCCT 
TATTGAGTAC 
CATAGAGTTG 
AAACTTTGTT 
CTCAACAGAA 
ACAGAGCTCT 
AACTGATTGC 
GAGGCACTTC 
ATTTTCCTGA 



AAGTGAGCAG 
CTGGTGCCTG 
CATGGCAACT 
GGAAAGTATT 
AGGTAAAATT 
GATGCGCATC 
AATAAGAGAA 
TCCATCCATC 
CTACTGTGTG 
ATTGTCTAGT 
TGTCACAAGT 
CATATGTTGC 
GGGTTGTGCA 
AACAGACTGT 
CACCTTGGCT 
AAGTGTTTTT 



TGTTGGAGTG 
CCACCTTCAA 
TAGAGATTGC 
TACTTTTTCG 
CTCTAGATTT 
AATGTATTTT 
TGTTCCTACT 
TTTCCATCCA 
CCAGGGGCTG 
GAGGAAGACA 
GGTGTTTATT 
AAGACCCTCC 
CATTTCTTTG 
TGAGTTATGA 
GGGAAGACTA 
AAATAAAGAA 



Seq ID NO: 147 Protein Sequence 
Protein Accession #: NP_005553 



I 

MPALWLGCCL 
DGIHCEKCKN 
HMLTDAGCTQ 
PEGCTQCFCY 
SSAQRLDPVY 
MPLGKTLPCG 
DNVTLISARP 
GGACDPDTGD 
CNNCPPGVTG 
CIHNTAGIYC 
NCEHGAFSCP 
QDILRDAQIS 
RLITQMQLSL 
ET ED Y SKQAL 
ADRSYQHSLR 
NWKEEAOQLL 
QVDNRKAEAE 
IEQEIGSLNL 
AQKVDTRAKN 
MMSELEERAR 



11 
I 

CFSLLLPAAR 
GFYRHRERDR 
DQRLLDSKCD 
GHSASCRSSA 
FVAPAKFLGN 
LTKTYTFRLN 
VSGAPAPWVE 
CYSGDENPDI 
ARCELCADGY 
DQCKAGYFGD 
ACYNQVKIQM 
EGASRSLGLQ 
AESEASLGNT 
SLVRKALHEG 
LU3SVSRLQG 
QNGKSGREKS 
EAMKRLSYIS 
EANVTADGAL 
AGVTIQDTLN 
QQRGHLHLLE 



21 
I 

ATSRREVCDC 
CLPCNCNSKG 
CDPAGIAGPC 
EYSVHKITST 
QOVSYGQSLS 
EHPSNNWSPQ 
QCICPVGYKG 
ECADCPIGFY 
FGDPFGEHGP 
PLAPNPADKC 
DQFMQQLQRM 
LAKVRSOENS 
NIPASDHYVG 
VGSGSGSPDG 
VSDQSFQVEE 
DQLLSRANLA 
QKVSDASDKT 
AMEKGLASLK 
TLDGLLHLMD 
TSIDGILADV 



31 
I 

NGKSRQCIFD 
SLSARCDNSG 
DAGRCVCKPA 
FHQDVDGWKA 
FDYRVDRGGR 
LSYFEYRRLL 
QFCQDCASGY 
NDPHDPRSCK 
VRPCQPCQCN 
RACNCNPMGS 
EALISKAQGG 
YQSRLDDLKM 
PNGFKSLAQE 
AWQGLVEKL 
AKRIKQKADS 
KSRAQEALSM 
QQAERALGSA 
SEMREVEGEL 
QPLSVDEEGL 
KNLENIRDNL 



41 
I 

RELHRQTGNG 
RCSCKPGVTG 
VTGERCDRCR 
VQRNGSPAKL 
HPSAHDVILE 
RNLTALRIRA 
KRDSARLGPF 
PCPCHNGFSC 
NNVDPSASGN 
EPVGCRSDGT 
DGWPDTELE 
TVERVRALGS 
ATRLAESHVE 
EKTKSLAQQIi 
LSTLVTRHMD 
GNATFYEVES 
AADAQRAKNG 
ERKELEFDTN 
VLLEQKLSRA 
PPGCYNTQAL 



51 
I 

FRCLNCNDNT 
ARCDRCLPGF 
SGYYNLDGGN 
QWSQRHQDVF 
GAGLRITAPL 
TYGEYSTGYI 
GTCIPCNCQG 
SVMPETEEW 
CDRLTGRCLK 
CVCKPGFGGP 
GRMQQAEQAL 
QYQNRVRDTH 
SASNMEQLTR 
TREATQAEIE 
EFKRTQKNLG 
ILKNLREFDL 
AGEALEISSE 
MDAVQMVITE 
KTQINSQLRP 
EQQ 



4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

72 0 

780 

840 

900 

960 

1020 

1080 

1140 



Seq ID NO: 148 DNA Sequence 
Nucleic Acid Accession #: NM_021048. 
Coding sequence: 327.. 1436 ~ 



GGCACGAGGG 
GCGGGCCCAG 
CAAGTCAACA 
AGTCCAGGTT 
CAGCACTGAA 
CACCTGCTAC 
CTGAAGAAGA 
TGGCTGTGGA 
TTCCCTCCTC 
AGGAGGTTTC 
CCTCCCCCTC 
AAAAGGAGGA 
GTGAGATAGA 
AGGAGCCGAT 
TCCCTTTGTT 
AGGAAGTGGA 
ATGGGATGCT 
GCATAATCTT 
TGATGGGGCT 
CCCAAGATTG 
CACGGTATGA 
TGAAATTTTT 
AGGCTTTGAA 
CTGCCATGGC 
AGACAGATTC 
GTGGAATCTA 



11 
I 

AGAAGCGAGG 
GCTCTGTGAG 
CAGGGAACCC 
CTCAGGGGAC 
GGAGAAGACC 
CCTGATCAGA 
TCTTCAATCC 
GGAGGATGCT 
CTCCTCTTCC 
TGCTGATGAT 
GGTCGTTGCT 
GAGTCCAAGC 
TGAAAAGGTG 
CACAAAGGCA 
GTTTAGTGAA 
TCCCACTGGC 
GAGTGATGTC 
CATAGAGGGC 
GTATGATGGG 
GGTGCAGGAA 
GTTTCTGTGG 
GGCCAAGGTA 
AGATGAGGAA 
CAGTGCAAGT 
TTCACTGTGT 
AAAAAAAAAA 



21 
I 

TTCTCGTTCT 
GAGGCAAGGG 
CTCTTTTCTA 
AGGGAGAGCA 
TGCCTGTGGG 
GTCATCATGC 
CAAAGTGAGA 
TCATCATCCA 
TCCTCCTCCT 
GAGACACCAA 
TCCCTTCCAT 
ACCCTACAGG 
ACTGATTTGG 
GAAATACTGG 
GCCTCCGAGT 
CACTCCTTTG 
CAGAGCATGC 
TACTGCACCC 
ATGGAGCACC 
AACTACCTGG 
GGTCCAAGGG 
AATGGGAGTG 
GAGAGAGCCC 
TCTAGCGCTA 
TTTAAAAGGC 
AAAAAAAA 



31 
I 

GAGGGACAGG 
AGGTGAGAAC 
CAGACACAGT 
AGAGGTCAAG 
TCCCCATCGC 
CTCGAGCTCC 
CACAGGGCCT 
CTTCCACCAG 
CCTGCTATCC 
ATCCTCCCCA 
TAGATCAATC 
TCCTGCCAGA 
TGCAGTTTCT 
AGAGTGTCAT 
GCATGCTGCT 
TCCTTGTCAC 
CCAAGACTGG 
CTGAGGAGGT 
TCATTTATGG 
AGTACCGGCA 
CTCATGCTGA 
ATCCAAGATC 
AGGACAGAAT 
CAGGTAGCTT 
AAGTCAAATA 



41 



I 

CTTGAGATCG 
CTTGCTCTCA 
GGGTCGCAGG 
AGCTGTGGGA 
CCAAGTCCTG 
AAAGCGTCAG 
CGAGGGTGCA 
CTCCTCTTTT 
TCTAATACCA 
GAGTGCTCAG 
TGATGAGGGC 
CAGTGAGTCT 
GCTCTTCAAG 
AAAAAATTAT 
GGTCTTTGGC 
CTCCCTGGGC 
CATTCTCATA 
CATCTGGGAA 
GGAGCCCAGG 
GGTGCCTGGC 
AATTAGGAAG 
CTTCCCACTG 
TGCCACCACA 
CTCCTACCCT 
CCACATGATT 



51 
I 

GCTGAAGAGA 
GAGGGTGACT 
ATCTGACAAG 
CACCACAGAG 
CCCACACTCC 
CGCTGCATGC 
CAGGCTCCCC 
CCATCCTCTT 
AGCACCCCAG 
ATAGCCTGCT 
TCCAGCAGCC 
TTACCCAGAA 
TATCAAATGA 
GAAGACCACT 
ATTGATGTAA 
CTCACCTATG 
CTTATCCTAA 
GCACTGAATA 
AAGCTGCTCA 
AGTGATCCTG 
ATGAGTCTCC 
TGGTATGAGG 
GATGATACTA 
GAATAAAGTA 
TTACTCATAT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 ■ 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 



281 



WO 03/003906 



PCT/US02/21338 



20 



Seq ID NO: 149 Protein Sequence 
Protein Accession #: NP_066386.1 

1 11 21 31 41 51 

5 | | | I I I 

MPRAPKRQRC MPEEDLQSQS ETQGLEGAQA PLAVEEDASS STSTSSSFPS SPPSSSSSSS 60 

SSCYPLIPST PEEVSADDET PNPPQSAQIA CSSPSWASL PLDQSDEGSS SQKEESPSTL 120 

QVLPDSESLP RSEIDEKVTD LVQFLLFKYQ MKEP1TKAEI LESVIKNYED HFPLLFSEAS 180 

ECMLLVFG1D VKEVDPTGHS FVLVTSLGLT YDGMLSDVQS MPKTGILILI LSIIFIEGYC 240 

10 TPEEVIWEAL NMMGLYDGME HLIYGEPRKL LTQDWVQENY LEYRQVPGSD PARYEFLWGP 300 

RAHAEIRKMS LLKFLAKVNG SDPRSFPLWY EEALKDEEER AQDRIATTDD TTAMASASSS 360 
ATGSFSYPE 

Seq ID NO t 150 DNA Sequence 
15 Nucleic Acid Accession #: NM_003695 
Coding sequence: 12.. 3 98 

1 11 21 31 41 51 

I I I I I I 

CGACATCAGA GATGAGGACA GCATTGCTGC TCCTTGCAGC CCTGGCTGTG GCTACAGGGC 60 

CAGCCCTTAC CCTGCGCTGC CACGTGTGCA CCAGCTCCAG CAACTGCAAG CATTCTGTGG 120 

TCTGCCCGGC CAGCTCTCGC TTCTGCAAGA CCACGAACAC AGTGGAGCCT CTGAGGGGGA 180 

ATCTGGTGAA GAAGGACTGT GCGGAGTCGT GCACACCCAG CTACACCCTG CAAGGCCAGG 240 

TCAGCAGCGG CACCAGCTCC ACCCAGTGCT GCCAGGAGGA CCTGTGCAAT GAGAAGCTGC 300 

25 ACAACGCTGC ACCCACCCGC ACCGCCCTCG CCCACAGTGC CCTCAGCCTG GGGCTGGCCC 360 

TGAGCCTCCT GGCCGTCATC TTAGCCCCCA GCCTGTGACC TTCCCCCCAG GGAAGGCCCC 420 

TCATGCCTTT CCTTCCCTTT CTCTGGGGAT TCCACACCTC TCTTCCCCAG CCGGCAACGG 480 

GGGTGCCAGG AGCCCCAGGC TGAGGGCTTC CCCGAAAGTC TGGGACCAGG TCCAGGTGGG 540 

CATGGAATGC TGATGACTTG GAGCAGGCCC CACAGACCCC ACAGAGGATG AAGCCACCCC 600 

30 ACAGAGGATG CAGCCCCCAG CTGCATGGAA GGTGGAGGAC AGAAGCCCTG TGGATCCCCG 660 

GATTTCACAC TCCTTCTGTT TTGTTGCCGT TTATTTTGTA CTCAAATCTC TACATGGAGA 720 
TAAATGATTT AAACC 

Seq ID NO: 151 Protein Sequence 
35 protein Accession tt: NP_003686 

1 11 21 31 41 51 

I i I 1 I 1 

MRTALLLLAA LAVATGPALT LRCHVCTSSS NCKHSWCPA SSRFCKTTNT VEPLRGNLVK 60 

40 KDCAESCTPS YTLQGQVSSG TSSTQCCQED LCNEKLHNAA PTRTALAHSA LSLGLALSLL 120 
AVILAPSL 

Seq ID NO.- 152 DNA Sequence 
Nucleic Acid Accession #: NM_006398.1 
45 Coding sequence: 19.. 516 

1 11 21 31 41 51 

I I I 1 I I 

GGCCCCTTGT CTGCAGAGAT GGCTCCCAAT GCTTCCTGCC TCTGTGTGCA TGTCCGTTCC 60 

50 GAGGAATGGG ATTTAATGAC CTTTGATGCC AACCCATATG ACAGCGTGAA AAAAATCAAA 120 

GAACATGTCC GGTCTAAGAC CAAGGTTCCT GTGCAGGACC AGGTTCTTTT GCTGGGCTCC 180 

AAGATCTTAA AGCCACGGAG AAGCCTCTCA TCTTATGGCA TTGACAAAGA GAAGACCATC 240 

CACCTTACCC TGAAAGTGGT GAAGCCCAGT GATGAGGAGC TGCCCTTGTT TCTTGTGGAG 300 

TCAGGTGATG AGGCAAAGAG GCACCTCCTC CAGGTGCGAA GGTCCAGCTC AGTGGCACAA 360 

55 GTGAAAGCAA TGATCGAGAC TAAGACGGGT ATAATCCCTG AGACCCAGAT TGTGACTTGC 420 
AATGGAAAGA GACTGGAAGA TGGGAAGATG ATGGCAGATT ACGGCATCAG AAAGGGCAAC 480 

TTACTCTTCC TGGCATCTTA TTGTATTGGA GGGTGACCAC CCTGGGGATG GGGTGTTGGC 540 

AGGGGTCAAA AAGCTTATTT CTTTTAATCT CTTACTCAAC GAACACATCT TCTGATGATT 600 

TCCCAAAATT AATGAGAATG AGATGAGTAG AGTAAGATTT GGGTGGGATG GGTAGGATGA 660 

60 AGTATATTGC CCAACTCTAT GTTTCTTTGA TTCTAACACA ATTAATTAAG TGACATGATT 720 

TTTACTAATG TATTACTGAG ACTAGTAAAT AAATTTTTAA GGCAAAATAG AGCATTC 



65 



75 



Seq ID NOt 153 Protein Sequence 
Protein Accession #: NP_006389.1 



1 11 21 31 41 51 

I I I 1 I.I 

MAPNASCLCV HVRSEEWDLM TFDANPYDSV KKIKEHVRSK TKVPVQDQVL LLGSKILKPR 60 
RSLSSYGIDK EKTIHLTLKV VKPSDEELPL FLVESGDEAK RHLLQVRRSS SVAOVKAMIE 120 
70 TKTGIIPETQ IVTCNGKRLE DGKMMADYGI RKGNLLFLAS YCIGG 



Seq ID NO: 154 DNA Sequence 

Nucleic Acid Accession »: BC017490.1 I* 

Coding sequence: 74.. 2788 

1 11 21 31 41 51 

I I I I I I 

GTGGGTCACG TGAACCACTT TTCGCGCGAA ACCTGGTTGT TGCTGTAGTG GCGGAGAGGA 60 

282 



WO 03/003906 



PCT/US02/21338 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



TCGTGGTACT 
GCGTCGGCGA 
TGCCCTCACC 
CCTAGGCACA 
CATGGAAAGG 
TCTGGATGAT 
CATGCGGCAG 
GTATGACAGC 
GGCCACGGAG 
TCTCAAAGGC 
CCACCGCTTC 
GGAGCGCATC 
CTTGGCAGCC 
GCAGATCTTT 
CATCACCAAC 
GCTGAGGCAG 
TGGCGTCCTG 
GGGTCCTTTC 
GTCGGCCGGC 
CCGAATCCAG 
CATTCTCCTC 
CATCTATCAC 
CACTGTCATC 
GACCGATGAA 
GATCTTTGCC 

TCTGGCCCTG 
TATCAACGTG 
TGAGAAAGTG 
CACGGCGTAT 
GGTTCTGGCT 
CAGAACCAGC 
CGTCACCTCC 
CTAOGACCCC 
CTTTGACATC 
CCGCTTCGTG 
GGCCAATGGC 
CCAGGAGGTC 
CCAGATGGAC 
GACAGGCAGC 
CCACGCGCGC 
CGTGATGCTG 
GACTTTTGCC 
GAAGCAGTTA 
CACTATTGAG 
CCTCTCTGCA 
AAGGAAAATG 
TTCTGGTTTG 
TGAACTCGGG 
TGTTTGTTTC 
TGTCTTACTT 
TTGGATCAGA 
ATGGATGTCA 
TGCCTTTGGC 
TCTGTGCCCC 
GTCGCAGGGG 
TTGCTCCCTG 
TAATTTTTAA 



GCTATGGCGG 
GGCAATGATC 
TCCAGCCCTG 
GAGGGGCCCC 
GACTACCGCG 
GAGGACGTAG 
CGTGACCGGG 
GATGAGGAGG 
GACGGCGAGG 
CACTCTGTGC 
AAGAACTTCC 
AGCGACATGT 
AGGGAGCACG 
GATGAGGCTG 
CACATCCATG 
CTGCATCTGA 
CCCCAGCTCA 
TGCCAGTCCC 
CCCTTTGAGG 
GAGAGTCCAG 
GCAGATCTGG 
AACAACTATG 
CTAGCCAACC 
GATGTGAAGA 
AGCATTGCTC 
TTOGGAGGGG 
CTCTTGTGCG 
TCCAGCCGAG 
GTCCAGCGGC 
GACCGAGGAG 
ATCCATGAGG 
CTGCAGGCTC 
TCGCTGACTT 
CTGTGTGTGG 
GTGGGCAGCC 
AGCGCTGCTG 
CTGAAGAAGT 
CAGGACAAGG 
ATCCCCATTA 
ATCCATCTGC 
GAGAGCTTCA 
CGCTACCTTT 
GTGGCAGAGC 
GTCCCTGAGA 
TTTTATGACA 
ATCCTGCAGC 
GGGTGGTCAG 
GTACTAGGGT 
TCCAAGCCTG 
GGTTGCTGAA 
GCTGCTGAGT 
GGAGAGCTGC 
CAGAGAGCTG 
TGTGGTGGAA 
TGGGATGTGA 
TCTGTTTCCC 
TAAAGTTGAA 



AATCATCGGA 
CTCTCACCTC 
GCCGTGACCT 
TGGAGGAAGA 
CCATCCCAGA 
AGGAGCTGAC 
AGGCTGGCCG 
ACGAGGAGCG 
AGGACGAGGA 
GCGAGTGGGT 
TGCGCACTCA 
GCAAAGAGAA 
TGCTGGCCTA 
CCCTGGAGGT 
TCCGCATCTC 
ACCAGCTGAT 
GCATGGTCAA 
AGAACCAGGA 
TCAACATGGA 
GCAAAGTGGC 
TGGACAGCTG 
ATGGCTCCCT 
ACGTGGCCAA 
TGATCACTAG 
CTTCCATCTA 
AGCCCAAAAA 
GAGACCCTGG 
CCATCTTCAC 
ACCCTGTCAG 
TGTGTCTCAT 
CCATGGAGCA 
GCTGCACGGT 
TCTCTGAGAA 
TGAGGGACAC 
ACGTCAGACA 
AGCCCGCCAT 
ACATCATCTA 
TGGCCAAGAT 
CGGTGCGGCA 
GGGACTATGT 
TAGACACACA 
CATTCCGGCG 
AGGTGACATA 
AGGACTTGGT 
GTGAGCTCTT 
AGTTCTGAGG 
TGCCCTCTGT 
CAGGGCTTAT 
CTTTGTGCTT 
CATCTTGCCA 
TCAGGATGCC 
TGCCCTCTTG 
GTTGAAGATG 
GAGGGCACGA 
GTCATGCGGA 
CACTCTCTTA 
TAAAATATAA 



ATCCTTCACC 
CAGCCCTGGC 
TCCACCATTT 
AGAGGATGGA 
GCTGGACGCC 
GGCCAGTCAG 
GGGCCTGGGC 
CCCTGCCCGC 
GATGATCGAG 
GAGCATGGCG 
CGTCGACAGC 
CCGTGAGAGC 
CTTCCTGCCT 
GGTACTGGCC 
CCACCTGCCT 
CCGCACCAGT 
GTACAACTGC 
GGTGAAACCA 
GGAGACCATC 
GGCTGGCCGG 
CAAGCCAGGA 
CAACACTGCC 
GAAGGACAAC 
CCTCTCCAAG 
TGGTCATGAA 
CCCAGGTGGC 
CACAGCGAAG 
CACTGGCCAG 
CAGGGAGTGG 
TGATGAATTT 
ACAGAGCATC 
CATTGCTGCC 
CGTGGACCTC 
CGTGGACCCA 
CCACCCCAGC 
GCCCAACACG 
CGCCAAGGAG 
GTACAGTGAC 
CATCGAGTCC 
GATCGAAGAC 
GAAGTTCAGC 
TGACAACAAT 
TCAGCGCAAC 
GGATAAGGCT 
CAGGATGAAC 
CCCTATGCCA 
GCTTTATGGA 
AGCAGGATGT 
CTCACCTTTG 
CCTCCGAGTG 
TGCGTGTGGT 
GCGTGAGTTG 
TTTGTAATCG 
CAGTGCCAGC 
TTATCCACTC 
TTTGTGCATT 
AAAAAAAAAA 



ATGGCATCCA 
CGAAGCTCCC 
GAGGATGAGT 
GAGGAGCTCA 
TATGAGGCCG 
AGGGAGGCAG 
CGCATGCGCC 
AAGCGCCGCC 
AGCATCGAGA 
GGCCCCCGGC 
CACGGCCACA 
CTGGTGGTGA 
GAGGCACCGG 
ATGTACCCCA 
CTGGTGGAGG 
GGGGTGGTGA 
AACAAGTGCA 
GGCTCCTGTC 
TATCAGAACT 
CTGCCCCGCT 
GACGAGATAG 
AATGGCTTCC 
AAGGTTGCTG 
GATCAGCAGA 
GACATCAAGA 
AAGCACAAGG 
TCGCAGTTTC 
GGGGOGTCGG 
ACCTTGGAGG 
GACAAGATGA 
TCCATCTCGA 
GCCAACCCCA 
ACAGAGCCCA 
GTCCAGGACG 
AACAAGGAGG 
TATGGCGTGG 
AGGGTCCACC 
CTGAGGAAAG 
ATGATCCGCA 
GACGTCAACA 
GTCATGCGCA 
GAGCTGTTGC 
CGCTTTGGGG 
CGTCAGATCA 
AAGTTCAGCC 
TCCATAAGGA 
CACAAAACCA 
CTGGCTGCAC 
GGTGGGATGC 
CTTTGTCTCC 
TTAGGTGTTA 
CGTATTCAGG 
TTTTCAGTCT 
GCAGCGTTCT 
GCCACAGTTA 
CGGTTTGGTT 
AAAAAA 



GCCCGGCCCA 
GGCGTACTGA 
CCGAGGGGCT 
TTGGAGATGG 
AGGGACTGGC 
CAGAGCGGGC 
GTGGGCTCCT 
AGGTGGAGCG 
ACCTGGAGGA 
TGGAGATCCA 
ACGTCTTCAA 
ACTATGAGGA 
CGGAGCTGCT 
AGTACGACCG 
AGCTGCGCTC 
CCAGCTGCAC 
ATTTCGTCCT 
CTGAGTGCCA 
ACCAGCGTAT 
CCAAGGACGC 
AGCTGACTGG 
CTGTCTTTGC 
TAGGGGAACT 
TCGGAGAGAA 
GAGGCCTGGC 
TACGTGGTGA 
TCAAGTATAT 
CTGTGGGCCT 
CTGGGGCCCT 
ATGACCAGGA 
AGGCTGGCAT 
TAGGAGGGCG 
TCATCTCACG 
AGATGCTGGC 
AGGAGGGGCT 
AGCCCCTGCC 
CGAAGCTCAA 
AATCTATGGC 
TGGCGGAGGC 
TGGCCATCCG 
GCATGCGCAA 
TCTTCATACT 
CCCAGCAGGA 
ACATCCACAA 
ACGACCTGAA 
TTCCTTGGGA 
GAGCACTTGA 
CTGGCATGAC 
CTTGCCAGTG 
ACTCAGTACC 
GCCTTCTTAC 
CTGCTTTTGC 
CCTGCAGGTT 
GGGCTCCTCA 
TCAGCTGCCA 
TCTGTAGTTT 



Seq ID NO ; 155 Protein Sequence 
Protein Accession # : AAH17490.1 



MAESSESPTM 
GPLEEEEDGE 
DREAGRGLGR 
SVREWVSMAG 
EHVLAYFLPE 
HLNQLIRTSG 
FEVNMEETIY 
NYDGSLNTAN 
IAPSIYGHED 
SRAIFTTGQG 
HEAMEQQSIS 
CWRDTVDPV 
KKYIIYAKER 
HLRDYVIEDD 
AEQVTYQRNR 



11 
I 

ASSPAQRRRG 
ELIGDGMERD 
MRRGLLYDSD 
PRLEIKHRFK 
APAELLQIFD 
WTSCTGVLP 
QNYQRIRIQE 
GPPVFATVIL 
IKRGLALALF 
ASAVGLTAYV 
ISKAGIVTSL 
QDEMLARPW 
VHPKLNQMDQ 
VNMAIRVMLE 
FGAQQDTIEV 



21 
I 

NDPLTSSPGR 
YRAI PELDAY 
EEDEERPARK 
NFLRTHVDSH 
EAALEWLAM 
QLSMVKYNCN 
SPGKVAAGRL 
ANHVAKKDNK 
GGSPKNPGGK 
QRHPVSREWT 
QARCTVIAAA 
GSHVRHHPSN 
DKVAKMYSDL 
SFIDTQKFSV 
PEKDLVDKAR 



31 
I 

SSRRTDALTS 
EAEGIiAJjDDE 
RRQVERATED 
GHNVFKERIS 
YPKYDRITNH 
KCNPVLGPFC 
PRSKDAILLA 
VAVGELTDED 
HKVRGDINVL 
LEAGALVLAD 
NPIGGRYDPS 



RKESMATGSI 
MRSMRKTFAR 
QINIHNLSAF 



41 

I 

SPGRDLPPPE 
DVEELTASQR 
GEEDEEMIES 
DMCXENRESL 
IHVRISHLPL 
QSQNQEVKPG 
DLVDSCKPGD 
VKMITSLSKD 
IiCGDPGTAKS 
RGVCLIDEFD 
LTFSENVDLT 
AAEPAMPNTY 
PITVRHIESM 
YLSFRRDNNE 
YDSEIiFRMNK 



51 
I 

DESEGLLGTE 
EAAERAMRQR 
IENLEDLKGH 
WNYEDLAAR 
VEELRSLRQL 
SCPECQSAGP 
EIELTGIYHN 
QQIGEKIFAS 
QFLKYIEKVS 
KMNDQDRTSI 
EPIISRFDIL 
GVEPLPQEVL 
IRMAEAHARI 
LLLFILKQLV 
FSHDLKRKMI 



120 

ISO 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 . 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 . 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 



283 



WO 03/003906 



PCT/US02/21338 



LQQF 

Seq ID NO: 156 DNA Sequence 
Nucleic Acid Accession &: NM_002497.1 
5 Coding sequence: 135.. 1472 

1 11 21 31 41 51 

i I I I I I 

GGCACGAGTA 6GGGTGGC6G GTCAGTGCTG CTCGGGGGCT TCTCCATCCA GGTCCCTGGA 60 

10 GTTCCTGGTC CCTGGAGCTC CGCACTTGGC GCGCAACCTG CGTGAGGCAG CGCGACTCTG 120 

GCGACTGGCC GGCCATGCCT TCCCGGGCTG AGGACTATGA AGTGTTGTAC ACCATTGGCA 180 

CAGGCTCCTA CGGCCGCTGC CAGAAGATCC GGAGGAAGAG TGATGGCAAG ATATTAGTTT 240 

GGAAAGAACT TGACTATGGC TCCATGACAG AAGCTGAGAA ACAGATGCTT GTTTCTGAAG 300 

TGAATTTGCT TCGTGAACTG AAACATCCAA ACATCGTTCG TTACTATGAT CGGATTATTG 360 

15 ACCGGACCAA TACAACACTG TACATTGTAA TGGAATATTG TGAAGGAGGG GATCTGGCTA 420 

GTGTAATTAC AAAGGGAACC AAGGAAAGGC AATACTTAGA TGAAGAGTTT GTTCTTCGAG 480 

TGATGACTCA GTTGACTCTG GCCCTGAAGG AATGCCACAG ACGAAGTGAT GGTGGTCATA 540 

CCGTATTGCA TCGGGATCTT AAACCAGCCA ATGTTTTCCT GGATGGCAAG CAAAACGTCA 600 

AGCTTGGAGA CTTTGGGCTA GCTAGAATAT TAAACCATGA CACGAGTTTT GCAAAAACAT 660 

20 TTGTTGGCAC ACCTTATTAC ATGTCTCCTG AACAAATGAA TCGCATGTCC TACAATGAGA 720 

AATCAGATAT CTGGTCATTG GGCTGCTTGC TGTATGAGTT ATGTGCATTA ATGCCTCCAT 7 80 

TTACAGCTTT TAGCCAGAAA GAACTCGCTG GGAAAATCAG AGAAGGCAAA TTCAGGCGAA 840 

TTCCATACCG TTACTCTGAT GAATTGAATG AAATTATTAC GAGGATGTTA AACTTAAAGG 900 

ATTACCATCG ACCTTCTGTT GAAGAAATTC TTGAGAACCC TTTAATAGCA GATTTGGTTG 960 

25 CAGACGAGCA AAGAAGAAAT CTTGAGAGAA GAGGGCGACA ATTAGGAGAG CCAGAAAAAT 1020 

CGCAGGATTC CAGCCCTGTA TTGAGTGAGC TGAAACTGAA GGAAATTCAG TTACAGGAGC 1080 

GAGAGCGAGC TCTCAAAGCA AGAGAAGAAA GATTGGAGCA GAAAGAACAG GAGCTTTGTG 114 0 

TTCGTGAGAG ACTAGCAGAG GACAAACTGG CTAGAGCAGA AAATCTGTTG AAGAACTACA 1200 

GCTTGCTAAA GGAACGGAAG TTCCTGTCTC TGGCAAGTAA TCCAGAACTT CTTAATCTTC 1260 

30 CATCCTCAGT AATTAAGAAG AAAGTTCATT TCAGTGGGGA AAGTAAAGAG AACATCATGA 1320 

GGAGTGAGAA TTCTGAGAGT CAGCTCACAT CTAAGTCCAA GTGCAAGGAC CTGAAGAAAA 1380 

GGCTTCACGC TGCCCAGCTG CGGGCTCAAG CCCTGTCAGA TATTGAGAAA AATTACCAAC 1440 

TGAAAAGCAG ACAGATCCTG GGCATGCGCT AGCCAGGTAG AGAGACACAG AGCTGTGTAC 1500 

AGGATGTAAT ATTACCAACC TTTAAAGACT GATATTCAAA TGCTGTAGTG TTGAATACTT 1560 

35 GGCCCCATGA GCCATGCCTT TCTGTATAGT ACACATGATA TTTCGGAATT GGTTTTACTG 1620 

TTCTTCAGCA ACTATTGTAC AAAATGTTCA CATTTAATTT TTCTTTCTTC TTTTAAGAAC 1680 

ATATTATAAA AAGAATACTT TCTTGGTTGG GCTTTTAATC CTGTGTGTGA TTACTAGTAG 1740 

GAACATGAGA TGTGACATTC TAAATCTTGG GAGAAAAAAT AATATTAGGA AAAAAATATT 1800 

TATGCAGGAA GAGTAGCACT CACTGAATAG TTTTAAATGA CTGAGTGGTA TGCTTACAAT 1860 

40 TGTCATGTCT AGATTTAAAT TTTAAGTCTG AGATTTTAAA TGTTTTTGAG CTTAGAAAAC 1920 

CCAGTTAGAT GCAATTTGGT CATTAATACC ATGACATCTT GCTTATAAAT ATTCCATTGC 1980 

TCTGTAGTTC AAATCTGTTA GCTTTGTGAA AATTCATCAC TGTGATGTTT GTATTCTTTT 2040 

TTTTTTTCTG TTTAACAGAA TATGAGCTGT CTGTCATTTA CCTACTTCTT TCCCACTAAA 2100 
TAAAAGAATT CTTCAGTTA 
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Seq ID NO j 157 Protein Sequence 
Protein Accession fc: NP_002488.1 



1 11 21 31 41 51 

50 | | | | ] | 

MPSRAEDYEV LYTIGTGSYG RCQKIRRKSD GKILVWKBLD YGSMTEAEKQ MLVSEVNLLR 60 

ELKHPNIVRY YDRIIDRTNT TLYIVMEYCE GGDLASVITK GTKERQYLDE EFVLRVMTQL 120 

TIALKECHRR SDGGHTVLHR DLKPANVFLD GKQNVKLGDP GLARILNHDT SFAKTFVGTP 180 

YYMSPEQMNR MSYNEKSDIW SLGCLLYELC ALMPPFTAFS QKELAGKIRE GKFRRIPYRY 240 

55 SDELNEIITR MLNLKDYHRP SVEEILENPL IADLVADEQR RNLERRGRQL GEPEKSQDSS 300 

PVLSELKLKE IQLQERERAL KAREERLEQK EQELCVRERL AEDKLARAEN LLKNYSLLKE 360 

RKFLSLASNP ELLNLPSSVI KKKVHFSGES KENIMRSENS ESQLTSKSKC KDLKKRLHAA 420 
QLRAQALSDI EKNYQLKSRQ ILGMR 

60 Seq ID NO: 158 DNA Sequence 

Protein Accession #: Eos sequence 

1 11 21 31 41 51 

« I I I I II 

O J TGAGTTTGCC CCCTTACCCC CATCCCAGTG AATATTTGCA ATTCCTAAAG. ACGTGTTTTG 60 

ATTGTCACAC CTGGGTGGGG AACATGCTAC TGGCATCTAA TGCATAGAGG GCAGTAATGC 120 

TGCTAAACAT CTTTCAACGC ACAGGACAGA GCCCCACAAA AGAGAATTAT CTAGCCCCAA 180 

ATGTCCATAA CACTGCTGTT GAGAAAACCT ACCGCAGGAT CTTACTGGGC TTCATAGGTA 240 

AGCTTGCCTT TGTTCTGGCT TCTGTAGATA TATAAAATAA AGACACTGCC CAGTCCCTCC 300 

70 CTCAACGTCC CGAGCCAGGG CTCAAGGCAA TTCCAATAAC AGTAGAATGA ACACTAAATA 3 60 

TTGATTTCAA AATCTCAGCA ACTAGAAGAA TGACCAACCA TCCTGGTTGG CCTGGGACTG 420 

TCCTAGTTTT AGCATTGAAA GTTTCAGGTT CCAGGAAAGC CCTCAGGCCT GGGCTGCTGG 480 

TCACCCTAGC AGCTGAGGGA CTCTTCAATA CAGAATTAGT CTTTGTGCAC TGGAGATGAA 540 

TATACTTTAA TTTGTAACAT GTGAAAACAT CTATAAACAT CTACTGAAGC CTGTTCTGTC 600 

75 TGCACCGACA TTTTCATTGA GTACGGATTC TTCCTACCAG ATACAGCTGC TCTACAACTT 660 

TCGAGGGCTG GTATAAAACT AGCTTTTACC TATTTTTAAA AATTACATGA ATAGTAAAAA 720 

CTTGGATTAA CCCAGTATTC GGGTATTTTC AATTTCCTTG GGAGCTTAGA GGACGGACAA 780 

ATAAAAAGAT TATTTCAACA TCAAATATAY GCTATTGTTT ACATATGAAG ATAACCACAT 840 

284 



WO 03/003906 



ATATGTATAA ATTCACOGTT ACTTTTTAGC AATACTATAA AATCCAACAG AAAAAAATAG 900 
CATTTACTAT 

Seq ID NO: 15 9 DNA Sequence 

Nucleic Acid Accession 8: Eos sequence 

Coding sequence: 264.. 782 



1 11 21 31 41 SI 

I I I I I | 

CCCTGCTCCA GTCACACCCG GAAGCTGACT GGTCCACGCA CAGCTGAAGC ATGAGGAAAC 60 

TCATCGCGGG ACTAATTTTC CTTAAAATTT AGACTTGCAC AGTAAGGACT TCAACTGACC 120 

TTCCTCAGAC TGAGAACTGT TTCCAGTATA TACATCAAGT CACTGAGATC TCCAGCACCC 180 

TGCCGGTGGC ACTACTGAGA GACGAGGTGC CAGGGTGGTT CCTGAAAGTG CCTGAGCCCC 240 

AACTTATCAG CAAGGAGCTC ATCATGCTGA CAGAAGTCAT GGAGGTCTGG CATGGCTTAG 300 

TGATCGCGGT GGTGTCCCTC TTCCTGCAGG CCTGCTTCCT CACCGCCATC AACTACCTGC 360 

TCAGCAGGCA CATGGCCCAC AAGAGTGAAC AGATACTGAA AGCGGCCAGT CTCCAGGTTC 420 

CCAGGCCCAG CCCTGGCCAC CATCATCCAC CTGCTGTCAA AGAGATGAAG GAGACTCAGA 480 

CAGAGAGAGA CATCCCAATG TCTGATTCCC TTTACAGGCA TGACAGCGAC ACACCCTCAG 540 

ATAGCTTGGA TAGCTCCTGC AGTTCGCCTC CTGCCTGCCA GGCCACAGAG GATGTGGATT 600 

ACACACAAGT CGTCTTTTCT GACCCTGGAG AACTAAAAAA TGACTCCCCG CTGGACTATG 660 

AGAACATAAA GGAAATCACA GATTATGTCA ATGTCAATCC AGAAAGACAC AAGCCCAGTT 72 0 

TCTGGTATTT TGTCAACCCT GCTCTGTCTG AGCCAGCGGA ATATGATCAA GTGGCCATGT 780 

GAATTCCAAA TATTTTTAAT GGGGTCCAGT TCTCTATGGA TTCTTACATT TAATTTGTAG 840 

GGAAATGCCA TTTTTCCCCC TTAAACAAGG CATGGGGCTC ACAAGTCTAT GGAGACAGGC 900 

CAAAAAGAAT GTGGAGAAGA AAACTGATAA ATACACAGAG GTCCTCAAGA CCCATGGACT 960 

CCTGGTCTGT ACCCAAAAAA GCTGTTCGTT CCTCAAAAAC AAAAACAAGG CTTGGCTGGG 1020 

AAAACAGGCC AATGCCCCGG CAAGAAAGGT TGAGATCAGA TGTTAGGAAG AACTTTCAGG 1080 

TAAAGTATGA GAACTATGGA GTCCATCAGC AGAGATAGTA GTGAAGTCTC TCCCCAGGGA t 1140 

AAATTTTAAA AAGGTTGAAT CAGCTGTTGT AGAGTTCTAT TTGGCAATCT CATGGTTAAA 1200 

TGACTTCCCT TTGAGCTCTT TAATTATTGG CAATAAACAA CTTCTTTAAA AGTTTTAAAT 1260 
AAAATAGCAA CCACCACCA 



Seq ID NO: 160 Protein Sequence 
Protein Accession J* : Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MLTEVMEVWH GLVIAWSLF LQACFLTAIN YLLSRHMAHK SEQILKAASL QVPRPSPGHH 60 
HPPAVKEMKE TQTERDIPMS DSLYRHDSDT PSDSLDSSCS SPPACQATED VDYTQWFSD 120 
PGELKNDSPL DYENIKEITD YVNVNPERHK PSPWYFVNPA LSEPAEYDQV AM 

Seq ID NO: 161 DNA Sequence 
Nucleic Acid Accession #: NM_012152 
Coding sequence: 43.. 1104 



1 11 21 31 41 51 

I I I I | | 

CTTCTTTAAA TTTC TTTCT A GGATGTTCAC TTCTTCTCCA CAATGAATGA GTGTCACTAT 60 

GACAAGCACA TGGACTTTTT TTATAATAGG AGCAACACTG ATACTGTCGA TGACTGGACA 120 

GGAACAAAGC TTGTGATTGT TTTGTGTGTT GGGACGTTTT TCTGCCTGTT TATTTTTTTT 180 

TCTAATTCTC TGGTCATCGC GGCAGTGATC AAAAACAGAA AATTTCATTT CCCCTTCTAC 240 

TACCTGTTGG CTAATTTAGC TGCTGCCGAT TTCTTCGCTG GAATTGCCTA TGTATTCCTG 3 00 

ATGTTTAACA CAGGCCCAGT TTCAAAAACT TTGACTGTCA ACCGCTGGTT TCTCCGTCAG 360 

GGGCTTCTGG ACAGTAGCTT GACTGCTTCC CTCACCAACT TGCTGGTTAT CGCCGTGGAG 420 

AGGCACATGT CAATCATGAG GATGCGGGTC CATAGCAACC TGACCAAAAA GAGGGTGACA 480 

CTGCTCATTT TGCTTGTCTG GGCCATCGCC ATTTTTATGG GGGCGGTCCC CACACTGGGC 540 

TGGAATTGCC TCTGCAACAT CTCTGCCTGC TCTTCCCTGG CCCCCATTTA CAGCAGGAGT 600 

TACCTTGTTT TCTGGACAGT GTCCAACCTC ATGGCCTTCC TCATCATGGT TGTGGTGTAC 660 

CTGCGGATCT ACGTGTACGT CAAGAGGAAA ACCAACGTCT TGTCTCCGCA TACAAGTGGG 720 

TCCATCAGCC GCCGGAGGAC ACCCATGAAG CTAATGAAGA CGGTGATGAC TGTCTTAGGG 780 

GCGTTTGTGG TATGCTGGAC CCCGGGCCTG GTGGTTCTGC TCCTCGACGG CCTGAACTGC 840 

AGGCAGTGTG GCGTGCAGCA TGTGAAAAGG TGGTTCCTGC TGCTGGCGCT GCTCAACTCC 900 

GTCGTGAACC CCATCATCTA CTCCTACAAG GACGAGGACA TGTATGGCAC CATGAAGAAG 960 

ATGATCTGCT GCTTCTCTCA GGAGAACCCA GAGAGGCGTC CCTCTCGCAT CCCCTCCACA 1020 

GTCCTCAGCA GGAGTGACAC AGGCAGCCAG TACATAGAGG ATAGTATTAG CCAAGGTGCA 1080 

GTCTGCAATA AAAGCACTTC CTAAACTCTG GATGCCTCTC GGCCCACCCA GGTGATGACT 1140 
GTCTTAGG 



Seq ID NO: 162 Protein Sequence 
Protein Accession fl: NP 036284 



MNECHYDKHM DFFYNRSNTD TVDDWTGTKL VIVLCVGTFF CLFIFFSNSL VIAAVIKNRK 60 

FHFPFYYLLA NLAAADFFAG IAYVFLMFNT GPVSKTLTVN RWFLRQGLLD SSLTASLTNL 120 

LVIAVERHMS IMRMRVHSNL TKKRVTLLIL LVWAIAIFMG AVPTLGWNCL CNISACSSLA 180 

PIYSRSYLVF WTVSNLMAFL IMVWYLRIY VYVKRKTNVL SPHTSGSISR RRTPMKLMKT 240 
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VMTVLGAFW CWTPGLWLL LDGLNCRQCG VQHVKRWFLL LALLNSWNP IIYSYKDEDM 300 
YGTMKKMICC FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVCNK STS 



20 



J Seq ID NO: 163 DNA Sequence 

Nucleic Acid Accession #: NM_020242 
Coding sequence: 72.. 4240 

1 11 21 31 41 SI 

10 | | | | | | 

CAGTCGCGCG CGGTGCAGTC GGGAGGTGGA GGCACCGGCT GCATTGTTTT CGGGATCGAG 60 

GGGTGAGGGC GCTATGGCAC CCGGCTGCAA AACTGAGTTA CGCAGCGTGA CAAATGGTCA 120 

GTCTAACCAA CCAAGTAATG AAGGTGATGC CATCAAAGTT TTTGTGCGAA TTCGTCCTCC 1B0 

TGCAGAAAGA TCTGGGTCAG CTGATGGAGA GCAGAACTTA TGCTTATCTG TGCTGTCCTC 240 

15 CACGAGTCTC CGGCTGCACT CCAACCCTGA GCCCAAGACC TTCACGTTTG ATCATGTTGC 300 

AGATGTGGAT ACCACTCAGG AATCTGTATT TGCAACTGTG GCTAAAAGCA TTGTGGAGTC 360 

TTGCATGAGC GGTTATAATG GTACCATCTT TGCATATGGA CAGACTGGCT CAGGGAAGAC 420 

ATTTACTATG ATGGGACCAT CTGAATCTGA TAATTTTTCT CATAACCTGA GAGGAGTAAT 4 80 

CCCACGAAGT TTTGAATATT TGTTTTCCTT AATTGATCGT GAAAAAGAAA AGGCTGGAGC 540 

TGGAAAGAGT TTCCTTTGTA AGTGTTCCTT TATTGAAATC TACAACGAGC AGATATATGA 600 

TCTACTGGAC TCTGCATCGG CTGGACTGTA CTTAAGGGAG CATATCAAGA AGGGAGTCTT 660 

TGTTGTTGGT GCGGTGGAGC AGGTGGTAAC CTCAGCTGCT GAAGCCTATC AGGTGCTGTC 720 

TGGAGGATGG AGGAATAGAC GTGTGGCATC AACATCAATG AACAGAGAAT CGTCTAGGTC 780 

TCATGCCGTC TTTACAATTA CAATAGAGTC AATGGAGAAA AGTAATGAGA TTGTGAATAT 840 

25 ACGGACCTCC CTACTCAACC TGGTGGATTT AGCAGGATCT GAAAGGCAAA AAGATACCCA 900 

TGCAGAAGGG ATGAGATTGA AGGAAGCAGG TAACATAAAT CGATCATTGA GCTGCCTGGG 960 

CCAAGTGATT ACAGCACTTG TCGACGTGGG TAATGGAAAA CAGAGACATG TTTGCTACAG 1020 

AGACTCCAAA CTTACCTTCT TACTACGGGA TTCCCTTGGA GGTAATGCCA AAACAGCCAT 1080 

AATTGCAAAT GTTCATCCTG GATCCAGGTG TTTTGGGGAA ACCCTATCAA CACTTAACTT 1140 

TGCTCAAAGA GCCAAGCTGA TTAAAAACAA GGCAGTAGTA AATGAAGACA CCCAAGGAAA 1200 

TGTGAGCCAG CTCCAAGCTG AAGTGAAGAG GCTCAAAGAA CAACTGGCGG AGCTTGCTTC 1260 

AGGACAGACA CCACCAGAAA GCTTCCTGAC CAGAGACAAA AAGAAGACTA ACTATATGGA 1320 

GTATTTCCAG GAAGCAATGT TATTCTTTAA GAAATCTGAA CAGGAAAAGA AGTCTCTGAT 1380 

AGAAAAAGTT ACCCAATTAG AAGACCTCAC CCTCAAAAAG GAAAAATTTA TTCAATCTAA 1440 

TAAAATGATT GTGAAATTCC GAGAGGATCA AATAATACGC TTGGAAAAGC TCCACAAGGA IS 00 

ATCCCGGGGA GGTTTTCTGC CTGAGGAGCA GGATCGTTTG CTCTCAGAAT TAAGGAATGA IS 60 

GATTCAAACT CTGCGAGAAC AAATAGAGCA CCACCCCAGA GTTGCAAAGT ATGCTATGGA 1620 

AAATCATTCC CTCAGGGAGG AGAATAGAAG ACTGAGATTA TTAGAGCCTG TGAAAAGAGC 1680 

TCAAGAAATG GATGCCCAGA CCATTGCAAA ACTAGAAAAA GCTTTCTCTG AAATAAGTGG 1740 

CATGGAGAAA AGTGACAAAA ATCAGCAAGG ATTTTCACCT AAAGCTCAGA AAGAGCCATG 1800 

TTTGTTTGCA AACACTGAGA AGTTAAAAGC ACAACTCCTG CAAATTCAGA CAGAGCTGAA 1860 

TAATTCAAAG CAAGAATATG AAGAATTCAA AGAACTTACT AGGAAAAGGC AGCTAGAATT 1920 

GGAATCAGAG CTTCAGTCTT TGCAAAAAGC GAACCTTAAT CTTGAAAACC TTTTGGAAGC 1980 

AACAAAAGCC TGCAAGCGGC AAGAAGTTTC TCAGCTGAAT AAAATTCATG CTGAAACACT 2040 

TAAGATTATA ACTACACCAA CCAAGGCCTA CCAACTTCAT TCCCGACCAG TACCAAAATT 2100 

AAGCCCTGAA ATGGGAAGCT TTGGCTCTCT ATACACTCAG AATTCTAGCA TATTAGATAA 2160 

TGATATATTA AATGAGCCAG TTCCTCCTGA GATGAATGAA CAAGCTTTTG AGGCCATTTC 2220 

TGAAGAGCTT AGAACAGTGC AGGAACAAAT GAGTGCTCTT CAAGCCAAAC TGGATGAAGA 2280 

AGAGCATAAA AACCTAAAGC TTCAGCAGCA TGTTGACAAA CTGGAACATC ATTCTACCCA 2340 

AATGCAGGAG CTTTTCTCAT CAGAAAGAAT TGATTGGACC AAACAGCAGG AAGAGCTTCT 2400 

CTCACAGTTG AATGTCCTTG AAAAGCAGCT TCAAGAGACT CAAACTAAAA ATGACTTTTT 2460 

GAAAAGTGAG GTACATGACC TGCGAGTAGT CCTTCATTCT GCTGACAAGG AGCTTTCTTC 2520 

AGTGAAATTG GAATATAGTT CATTCAAAAC GAATCAGGAG AAAGAATTCA ACAAACTTTC 2580 

TGAAAGACAC ATGCATGTAC AGCTTCAATT AGATAATCTC AGGTTAGAAA ACGAAAAGCT 2640 

GCTTGAGAGC AAAGCCTGCC TACAGGATTC CTATGACAAC TTACAAGAAA TAATGAAATT 2700 

TGAGATTGAC CAACTTTCAA GAAACCTCCA AAACTTCAAA AAAGAAAATG AAACTCTGAA 2760 

ATCTGATCTG AATAATTTGA TGGAGCTTCT TGAGGCAGAA AAAGAACGCA ATAACAAATT 2820 

ATCATTACAG TTTGAAGAAG ATAAAGAAAA CAGTTCTAAA GAAATCTTAA AAGTTCTTGA 2880 

GGCTGTACGT CAGGAGAAAC AGAAAGAGAC GGCCAAGTGT GAGCAGCAGA TGGCAAAAGT 2940 

60 ACAGAAACTA GAAGAGAGCT TGCTTGCTAC TGAAAAAGTG ATCAGTTCCC TGGAAAAGTC 3000 

TAGAGATTCT GATAAGAAAG TTGTAGCTGA CCTCATGAAC CAGATCCAGG AGCTAAGAAC 3060 

ATCGGTCTGT GAGAAAACAG AAACTATAGA CACCCTGAAA CAAGAACTGA AGGACATAAA 3120 

TTGCAAATAC AACTCTGCTT TGGTTGACAG AGAAGAGAGC AGAGTGTTGA TCAAGAAGCA 3180 

GGAAGTGGAT ATTCTGGATC TGAAAGAAAC CCTTAGGCTG AGAATACTTT CTGAGGACAT 3240 

AGAGAGGGAT ATGCTCTGTG AGGACCTGGC TCATGCCACT GAGCAGCTGA ACATGCTCAC 3300 

AGAGGCCTCA AAAAAACACT CGGGGCTGCT GCAGTCTGCC CAGGAAGAAC TGACCAAGAA 3360 

GGAAGCCCTG ATTCAGGAAC TTCAGCACAA GCTAAACCAA AAGAAAGAGG AAGTAGAACA 3420 

GAAGAAGAAT GAATATAACT TCAAAATGAG GCAACTAGAA CATGTGATGG ATTCTGCTGC 3480 

TGAGGATCCC CAGAGTCCTA AGACACCACC TCACTTTCAA ACACATTTGG CAAAACTCCT 3540 

GGAAACACAA GAACAAGAGA TAGAAGATGG AAGAGCCTCT AAGACTTCTT TGGAACACCT 3600 

TGTAACAAAG CTAAATGAAG ACAGAGAAGT CAAAAATGCT GAAATCCTCA GAATGAAGGA 3660 

GCAGTTGCGT GAAATGGAAA ACCTACGCCT GGAAAGTCAG CAGTTAATAG AGAAAAACTG 3720 

GCTCCTGCAA GGTCAGCTGG ATGATATTAA AAGACAAAAG GAAAACAGTG ATCAGAATCA 3780 

TCCAGATAAT CAACAGCTGA AGAATGAACA AGAAGAAAGT ATCAAAGAAA GACTTGCAAA 3840 

AAGTAAAATA GTTGAAGAAA TGCTGAAAAT GAAAGGAGAC CTAGAAGAAG TCCAAAGTGC 3900 

CCTTTACAAC AAAGAGATGG AATGCCTTAG AATGACTGAT GAAGTCGAAC GAACCCAAAC 3960 

TTTGGAGTCT AAAGCATTCC AGGAAAAAGA ACAACTGAGA TCAAAGCTGG AAGAAATGTA 4020 

TGAAGAAAGA GAGAGAACAT CCCAGGAGAT GGAAATGTTA AGGAAGCAGG TGGAGTGTCT 4080 
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TGCTGAGGAA AATGGAAAGT TGGTAGGTCA CCAAAATTTG CATCAGAAGA TTCAGTACGT 4140 

AGTGCGACTA AAGAAGGAAA ATGTCAGGCT TGCTGAGGAG ACAGAAAAGT TGCGTGCCGA 4200 

AAATGTATTT TTAAAAGAAA AGAAAAGAAG TGAATCTTGA GGATTCCGGT CAGCTACCTA 4260 

GGCATCACCT TGTTTGAAGA TGTTTCTTCT CTTTTACAAG TAAGACCTAC TCCTGGCCAC 4 320 

3 TTAGGAGAGC TGAATTTATG GACCTTAATT ATTAAATGTT TATAAGGTGG TGGTAACCAC 4380 

CTCAAGTTTC TGATGAACAT TCTGCATCCA TATACACCCT GTGACAGTCA GCAGTCTGCT 4440 

ATTAAGTGGC CTACTTCAAG GCTTTGAATC AACTTAAGGG AAAACCTTTT GTCTTTGTAA 4500 

AAATAAAAGC CTGTAGCTAA GGTTTACAGT GGACATTAGC CAGATCATTT TCTTCTTAGA 4560 

TTATGCCATA ATCTCCTTTG ATTCTTATGG AAGTTCTAAC AATATATGGT GGTTCCAACA 4620 

1U CCTGCAGTGA GTTTAATGAC TGACTTAGTA GCAGGTACAA GAAGCAAACT TGTTAATATA 4680 

GATTATTTTT GTATTCTTAC TTTAGGTATT TTACTTGAGC ATTTTCCATG ACTGTAAATA 4740 
AAGCCATTTT TTAAGATAAA AAAAAAAAAA AAAAA 

Seq id NO: 164 Protein Sequence 
I J Protein Accession #: NP_064627 

1 11 21 31 41 51 

I I I I I I 

n MAPGCKTELR SVTNGQSNQP SNEGDAIKVF VRIRPPAERS GSADGEQNLC LSVLSSTSLR 60 

ZU LHSNPBPKTF TFDHVADVDT TQESVFATVA KSIVESCMSG YNGTIFAYGQ TGSGKTFTMM 120 

GPSESDNFSH NLRGVIPRSF EYLFSLIDRE KEKAGAGKSF LCKCSFIEIY NEQIYDLLDS 180 

ASAGLYLREH IKKGVFWGA VEQWTSAAE AYQVLSGGWR NRRVASTSMN RESSRSHAVF 240 

TITIESMEKS NEIVNIRTSL LNLVDLAGSE RQKDTHAEGM RLKEAGNINR SItSCLGQVIT 300 * 

AJbVDVGNGKQ RHVCYRDSKL TFLLRDSLGG NAKTA1IANV HPGSRCFGET LSTLNFAQRA 360 

LJ KLIKNKAWN EDTQGNVSQL QAEVKRLKEQ LAELASGQTP PESFLTRDKK KTNYMEYFQE 420 

AMLFFKKSEQ EKKSLIEKVT QLEDLTUCKE KFIQSNKMIV KFREDQIIRL EKLHKESRGG 480 

FIiPEEQDRLL SELRNEIQTL REQIEHHPRV AKYAMENHSL REENRRLRLL EPVKRAQEMD 540 

AQTIAKLEKA FSEISGMEKS DKNQQGFSPK AQKEPCLFAN TEKLKAQLLQ IQTELNNSKQ 600 

EYEEFKELTR KRQLELESEL QSLQKANLNL ENLLEATKAC KRQEVSQLNK IHAETLKIIT 660 

3U TPTKAYQLHS RPVPKLSPEM GSFGSLYTQN SSILDNDILN EPVPPEMNEQ AFEAISEELR 720 

TVQEQMSALQ AKLDEEEHKN LKLQQHVDKL EHHSTQMQEL FSSERIDWTK QQEELLSQLN 780 

VLEKQLQETQ TKNDFLKSEV HDLRWLHSA DKELSSVKbE YSSFKTNQEK EFNKLSERHM 840 

HVQLQLDNLR LENEKLLESK ACLQDSYDNL QEIMKFEIDQ LSRNLQNFKK ENETLKSDLN 900 

NLMELLEAEK ERNNKLSLQF EEDKEKSSKE ILKVLEAVRQ EKQKETAKCE QQMAKVQKLE 960 

J J ESLLATEKVI SSLEKSRDSD KKWADLMNQ IQELRTSVCE KTETIDTLKQ ELKDINCKYN 1020 

SALVDREESR VLIKKQEVDI LDLKETLRLR ILSEDIERDM LCEDLAHATE QLNMLTEASK 1080 

KHSGLLQSAQ EELTKKEALI QELQHKLNQK KEEVEQKKNE YNFKMRQLEH VMDSAAEDPQ 1140 

SPKTPPHFQT HLAKLLETQE QEIEDGRASK TSLEHLVTKL NEDREVKNAE ILRMKEQLRE 1200 

MENLRDESQQ LIEKNWLLQG QLDDIKRQKE NSDQNHPDNQ QLKNEQEES I KERLAKSKIV 1260 

4U EEMLKMKADL EEVQSALYNK EMECLRMTDE VERTQTLESK AFQEKEQLRS KLEEMYEERE 1320 

RTSQEMEMLR KQVECLAEEN GKLVGHQNLH QKIQYWRLK KENVRLAEET EKLRAENVFL 1380 
KEKKRSES 

Seq ID NO: 165 DNA Sequence 
4> Nucleic Acid Accession #: CAT cluster 

1 11 21 31 41 51 

I _ 1 t I I ! 

TTTTTTTTTT TTTTTTCACA TGCTGATGTT GCTGCCGATG ATTTCAACGC CTGGCTTTGA 60 

jU GATTCCGTGA gtagtcttga ataatttaaa attcgaaaat caaattctac TTATTTTCTC 120 

TTAATGCTAT TGTATTTCCT AATTCTCAGC TTTAACATGT AAGAAAGTAC TTTCGCTAGG 180 

GGTCTTAATT GAATGGTGGG GTCGAGATGA CTGCGTCAGA ATTAAATCTC TGGAAGACCT 240 

CTGAGCTCCT TTTAAAATCA TCAACAAGCG AAAATCCTTA TCAATAGCGA TGTGGGAATG 300 

- CATTAGGTAC AGTATTTTAA ACATACAAAA CCTAGGCATA TTAAAAAGCA CTCCTCTGGT 3 60 

J J AATTTAATAA GGAATAATGA TGTCCTTAAG TTTATTTTAA TCAGCAAGTA TGACTCAATT 420 

TGAAAATATG AGAACAAATA GATTTAAATA GGAACACCCA GTAAACTATG GTATGCAAAT 480 

AAACTCAGAG GTAAACTTGT GAATACATAA ATCTAAATAA GTCAGTTACC ATCAAAATAT 540 

TACGTGATCC TATATTTTTC TGTCCACGTC TTAAAAGTAT TTTTTACCTT GGGGCTTTAT 600 

TTTCTTTCCC TCTTCCCAAG ATTATCCAAG GCAGTTCCAA TACGCGTTTT CAATAATGGT 660 

OU AATTAAACTT TTGGAGGGTA ATCGCCAGCT GTTTCCAGAA AAATACTTTT AATAGGTGGT 720 

AGTCCTCCCT GTAACTGGGT TTTCCCGGGA ATCCGGGGGG GGCATGTTGT TTTTATTGTC 780 

GCCATTTTTG TTTCGTCTTT TTTGTTTTGT TGGTGGTCAG GTTTTCTTTT CATAGCGGGG 840 

GAGAAGATTT TGGTCGGACT AGCGCCATCC AGCGGGTTTA GAGAAGGAGC ACACTTTTCC 900 
CCGAATAGTG GCTTTTGGAT GAACAATGAA 
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Seq ID NO: 166 DNA Sequence 
Nucleic Acid Accession #: NM_006953. 
Coding sequence: 3 3.. 8 96 



1 11 21 31 41 51 

I I I I I I 

CCGTTCCGCG CTCTGGCGGC TCCTCCCGGG CGATGCCTCC GCTCTGGGCC CTGCTGGCCC 60 

TCGGCTGCCT GCGGTTCGGC TCGGCTGTGA ACCTGCAGCC CCAACTGGCC AGTGTGACTT 120 

TCGCCACCAA CAACCCCACA CTTACCACTG TGGCCTTGGA AAAGCCTCTC TGCATGTTTG 180 

/J ACAGCAAAGA GGCCCTCACT GGCACCCACG AGGTCTACCT GTATGTCCTG GTCGACTCAG 240 

CCATTTCCAG GAATGCCTCA GTGCAAGACA GCACCAACAC CCCACTGGGC TCAACGTTCC 300 

TACAAACAGA GGGTGGGAGG ACAGGTCCCT ACAAAGCTGT GGCCTTTGAC CTGATCCCCT 360 

GCAGTGACCT GCCCAGCCTG GATGCCATTG GGGATGTGTC CAAGGCCTCA CAGATCCTGA 420 
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ATGCCTACCT GGTCAGGGTG GGTGCCAACG GGACCTGCCT GTGGGATCCC AACTTCCAGG 480 

GCCTCTGTAA CGCACCCCTG TCGGCAGCCA CGGAGTACAG GTTCAAGTAT GTCCTGGTCA 540 

ATATGTCCAC GGGCTTGGTA GAGGACCAGA CCCTGTGGTC GGACCCCATC CGCACCAACC 600 

AGCTCACCCC ATACTCGACG ATCGACACGT GGCCAGGCCG GCGGAGCGGA GGCATGATCG 660 

5 TCATCACTTC CATCCTGGGC TCCCTGCCCT TCTTTCTACT TGTGGGTTTT GCTGGCGCCA 720 

TTGCCCTCAG CCTCGTGGAC ATGGGGAGTT CTGATGGGGA AACGACTCAC GACTCCCAAA 780 

TCACTCAGGA GGCTGTTCCC AAGTCGCTGG GGGCCTCGGA GTCTTCCTAC ACGTCCGTGA 840 

ACCGGGGGCC GCCACTGGAC AGGGCTGAGG TGTATTCCAG CAAGCTCCAA GACTGAGCCC 900 

AGCACCACCC CTGGGCAGCA GCATCCTCCT CTCTGGCCTT GCCCCAGGCC CTGCAGCGGT 960 

10 GGTTGTCACA CCCTGACTTC AGGGAAGGTG AAACAGGGCT TGTCCCTCCA ACTGCAGGAA 1020 
AACCCTTAAT AAAATCTTCT GATGAGTTCT AAAAAAAAA 
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Seq ID NO: 167 Protein Sequence 
Protein Accession ft-. NPJ308884.1 



1 11 21 31 41 51 

I I I I I I 

MPPLWALLAL GCLRFGSAVN LQPQLASVTP ATNNPTLTTV ALEKPLCMFD SKEALTGTHE 60 

VYLYVLVDSA ISRNASVQDS TNTPLGSTPL QTEGGRTGPY KAVAFDLIPC SDLPSLDAIG 120 

20 DVSKASQ1LN AYLVRVGANG TCLWDPMFQG LCKAPLSAAT EYRFKYVLVN MSTGLVEDOT 180 

LWSDPIRTNQ LTPYSTIDTW PGRRSGGMIV ITSILGSLPF FLLVGFAGAI ALSLVDMGSS 240 
DGETTHDSQI TQEAVPKSLG ASESSYTSVN RGPPLDRAEV YSSKLQD 

25 Seq ID NO: 168 DNA Sequence 

Nucleic Acid Accession tt: NM_005672.1 
Coding sequence? 18.. 389 

1 11 21 31 41 51 

30 | | | | | I 

AGGGAGAGGC AGTGACCATG AAGGCTGTGC TGCTTGCCCT GTTGATGGCA GGCTTGGCCC 60 

TGCAGCCAGG CACTGCCCTG CTGTGCTACT CCTGCAAAGC CCAGGTGAGC AACGAGGACT 120 

GCCTGCAGGT GGAGAACTGC ACCCAGCTGG GGGAGCAGTG CTGGACCGCG CGCATCCGCG 180 

CAGTTGGCCT CCTGACCGTC ATCAGCAAAG GCTGCAGCTT GAACTGCGTG GATGACTCAC 240 

35 AGGACTACTA CGTGGGCAAG AAGAACATCA CGTGCTGTGA CACCGACTTG TGCAACGCCA 300 

GCGGGGCCCA TGCCCTGCAG CCGGCTGCCG CCATCCTTGC GCTGCTCCCT GCACTCGGCC 360 

TGCTGCTCTG GGGACCCGGC CAGCTATAGG CTCTGGGGGG CCCCGCTGCA GCCCACACTG 420 

GGTGTGGTGC CCCAGGCCTT TGTGCCACTC CTCACAGAAC CTGGCCCAGT GGGAGCCTGT 480 

CCTGGTTCCT GAGGCACATC CTAACGCAAG TTTGACCATG TATGTTTGCA CCCCTTTTCC 540 

40 CCNAACCCTG ACCTTCCCAT GGGCCTTTTC CAGGATTCCN ACCNGGCAGA TCAGTTTTAG 600 

TGANACANAT CCGCNTGCAG ATGGCCCCTC CAACCNTTTN TGTTGNTGTT TCCATGGCCC 660 

AGCATTTTCC ACCCTTAACC CTGTGTTCAG GCACTTNTTC CCCCAGGAAG CCTTCCCTGC 720 

CCACCCCATT TATGAATTGA GCCAGGTTTG GTCCGTGGTG TCCCCCGCAC CCAGCAGGGG 780 
ACAGGCAATC AGGAGGGCCC AGTAAAGGCT GAGATGAAGT GGACTGAGTA GAACTGGAGG ' 840 

45 ACAAGAGTTG ACGTGAGTTC CTGGGAGTTT CCAGAGATGG GGCCTGGAGG CCTGGAGGAA 900 

GGGGCCAGGC CTCACATTTG TGGGGNTCCC GAATGGCAGC CTGAGCACAG CGTAGGCCCT 960 
TAATAAACAC CTGTTGGATA AGCCAAAAAA 

Seq ID NO: 169 Protein Sequence 
50 Protein Accession #: NP_005663.1 

1 11 21 31 41 51 

I I I I I I 

MKAVLLALLM AGLALQPGTA LLCYSCKAQV SNEDCLQVEN CTQLGEQCWT ARIRAVGLLT 60 
55 VISKGCSLNC VDDSQDYYVG KKNITCCDTD LCNASGAHAL QPAAAILALL PALGLLLWGP 120 
GQL 

Seq ID NO: 170 DNA Sequence 
Nucleic Acid Accession ft: NMJ)06952.1 
60 Coding sequence: 11. .793 

1 11 21 31 41 51 

I 1 I I 1 I 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TGCTGATTTT 60 

65 TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA TCTTCTTTGT 120 

ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAACGATG ACATCTATGG 180 

GGCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 240 

TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC TGATGTTTAT 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG ACTTTTTCAC 360 

70 ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC CTCCAAACAA 420 

TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCCGGAC 540 

TGAGAATAAT GATGCTGACT ATCCCTGGCC TCGTCAATGC TGTGTTATGA ACAATCTTAA 600 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGCGTGCCT GGTTTTTATC ACAATCAGGG 660 

75 CTGCTATGAA CTGATCTCTG GTCCAATGAA CCGACACGCC TGGGGGGTTG CCTGGTTTGG 720 

ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 780 
AATTGAATAT TAAGAA 
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Seq id NO: 171 Protein Sequence 
Protein Accession # : NP_008883.1 

J 11 21 31 41 51 

11)1)1 

MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIFFVSDQ HSLYPLLEAT DNDDIYGAAW 60 

IGIFVGICLF CLSVLGIVGI MKSSRKILLA YFILMFIVYA FEVASCITAA TORDFFTPNL 120 

FLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 180 

DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE LISGPMNRHA WGVAWFGFAI 240 
LCWTFWVLLG TMFYWSRIEY 

Seq ID NO: 172 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence : 1 . . 672 



J 11 21 31 41 51 

111)11 

ATGAGGCTCC AAAGACCCCG ACAGGCCCCG GCGGGTGGGA GGCGCGCGCC CCGGGGCGGG 60 

CGGGGCTCCC CCTACCGGCC AGACCCGGGG AGAGGCGCGC GGAGGCTGCG AAGGTTCCAG 120 

AAGGGCGGGG AGGGGGCGCC GCGCGCTGAC CCTCCCTGGG CACCGCTGGG GACGATGGCG 180 

CTGCTCGCCT TGCTGCTGGT CGTGGCCCTA CCGCGGGTGT GGACAGACGC CAACCTGACT 240 

GCGAGACAAC GAGATCCAGA GGACTCCCAG CGAACGGACG AGGGTGACAA TAGAGTGTGG 300 

TGTCATGTTT GTGAGAGAGA AAACACTTTC GAGTGCCAGA ACCCAAGGAG GTGCAAATGG 360 

ACAGAGCCAT ACTGCGTTAT AGCGGCCGTG AAAATATTTC CACGTTTTTT CATGGTTGCG 420 

ZJ AAGCAGTGCT CCGCTGGTTG TGCAGCGATG GAGAGACCCA AGCCAGAGGA GAAGCGGTTT 480 

CTCCTGGAAG AGCCCATGCC CTTCTTTTAC CTCAAGTGTT GTAAAATTCG CTACTGCAAT 540 

TTAGAGGGGC CACCTATCAA CTCATCAGTG TTCAAAGAAT ATGCTGGGAG CATGGGTGAG 600 

AGCTGTGGTG GGCTGTGGCT GGCCATCCTC CTGCTGCTGG "CCTCCATTGC AGCCGGCCTC 660 
AGCCTGTCTT GA 



Seq ID NO: 173 Protein Sequence 
Protein Accession Eos sequence 



or 1 11 21 31 41 51 



MRLQRPRQAP AGGRRAPRGG RGSPYRPDPG RGARRLRRFQ KGGEGAPRAD PPWAFLGTMA - 60 

LLALLLWAL PRVWTDANLT ARQRDPEDSQ RTDEGDNRVW CHVCERENTF ECQNPRRCKW 120 

TEPYCVIAAV KIFPRFFMVA KQCSAGCAAM ERPKPEEKRF LLEEPMPFFY LKCCKIRYCN 180 
LEGPPINSSV FKEYAGSMGE SCGGLWIAIL LLLASIAAGL SLS 

Seq ID NO: 174 DNA Sequence - 
Nucleic Acid Accession 8: XM_057014 
Coding sequence: 143.. 874 

I 11 21 31 41 51 

II I I I | 
GGGAGGGAGA GAGGCGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 60 
CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 120 
CGCTGCCCGG CAGCCGGGAG CCATGCGACC CCAGGGCCCC GCCGCCTCCC CGCAGCGGCT 180 
CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 240 
CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GCAGAGGGAG GTGGTGGACC TGTATAATGG 300 
AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 360 
CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGATGGATTC AAAGGAGAAA AGGGGGAATG 420 
TCTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 480 
ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 540 
AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTAAAATGCA GAAATGCATG 600 
CTGTCAGCGT TGGTATTTCA CATTCAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 660 
AGCTATAATT TATTTGGACC AAGGAAGCCC TGAAATGAAT TCAACAATTA ATATTCATCG 720 
CACTTCTTCT GTGGAAGGAC TTTGTGAAGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 780 
CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGGAGATGCT TCTACTGGAT GGAATTCAGT 840 
TTCTCGCATC ATTATTGAAG AACTACCAAA ATAAATGCTT TAATTTTCAT TTGCTACCTC 900 
TTTTTTTATT ATGCCTTGGA ATGGTTCACT TAAATGACAT TTTAAATAAG TTTATGTATA 960 
CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACACTGTT 1020 
TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTTTTAGT 1080 
TGGTTAGAAT ACTT TCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTGTTGT 1140 
GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TTTTAAAAAA ATATAAAAGC TACCAATCTT 1200 
TGTACAATTT GTAAATGTTA AGAATTTTTT TTATATCTGT TAAATAAAAA TTATTTCCAA 12 60 
CAACCTTAAA AAAAAAAAAA AAAA 

Seq ID NO: 175 Protein Sequence 
Protein Accession ft: xp 057014 



nr } I 1 21 31 41 5 1 

MRPQGPAASP QRLRGLLLLL LLQLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCIiQGPA 60 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 120 
GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIEAIIYLDQ 180 
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GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW ' NSVSRI IIEE 240 
LPK I 

Seq ID NO: 176 DNA Sequence 
5 Nucleic Acid Accession ft: XMJJ84007 
Coding sequence: 1..2268 

1 11 21 31 41 51 

1A I I I I I I 

IU ATGGCGAGGA AGTTATCTGT AATCTTGATC CTGACCTTTG CCCTCTCTGT CACAAATCCC 60 

CTTCATGAAC TAAAAGCAGC TGCTTTCCCC CAGACCACTG AGAAAATTAG TCCGAATTGG 120 

GAATCTGGCA TTAATGTTGA CTTGGCAATT TCCACACGGC AATATCATCT ACAACAGCTT 180 

TTCTACCGCT ATGGAGAAAA TAATTCTTTG TCAGTTGAAG GGTTCAGAAA ATTACTTCAA 240 

AATATAGGCA TAGATAAGAT TAAAAGAATC CATATACACC ATGACCACGA CCATCACTCA 300 

15 GACCACGAGC ATCACTCAGA CCATGAGCX3T CACTCAGACC ATGAGCATCA CTCAGACCAC 360 

GAGCATCACT CTGACCATGA TCATCACTCC CACCATAATC ATGCTGCTTC TGGTAAAAAT 420 

AAGCGAAAAG CTCTTTGCCC AGACCATGAC TCAGATAGTT CAGGTAAAGA TCCTAGAAAC 480 

AGCCAGGGGA AAGGAGCTCA CCGACCAGAA CATGCCAGTG GTAGAAGGAA TGTCAAGGAC 540 

AGTGTTAGTG CTAGTGAAGT GACCTCAACT GTGTACAACA CTGTCTCTGA AGGAACTCAC 600 

20 TTTCTAGAGA CAATAGAGAC TCCAAGACCT GGAAAACTCT TCCCCAAAGA TGTAAGCAGC 660 

TCCACTCCAC CCAGTGTCAC ATCAAAGAGC CGGGTGAGCC GGCTGGCTGG TAGGAAAACA 720 

AATGAATCTG TGAGTGAGCC CCGAAAAGGC TTTATGTATT CCAGAAACAC AAATGAAAAT 780 

CCTCAGGAGT GTTTCAATGC ATCAAAGCTA CTGACATCTC ATGGCATGGG CATCCAGGTT 840 

CCGCTGAATG CAACAGAGTT CAACTATCTC TGTCCAGCCA TCATCAACCA AATTGATGCT 900 

25 AGATCTTGTC TGATTCATAC AAGTGAAAAG AAGGCTGAAA TCCCTCCAAA GACCTATTCA 960 

TTACAAATAG CCTGGGTTGG TGGTTTTATA GCCATTTCCA TCATCAGTTT CCTGTCTCTG 1020 

CTGGGGGTTA TCTTAGTGCC TCTCATGAAT CGGGTGTTTT TCAAATTTCT CCTGAGTTTC 1080 

CTTGTGGCAC TGGCCGTTGG GACTTTGAGT GGTGATGCTT TTTTACACCT TCTTCCACAT 1140 

TCTCATGCAA GTCACCACCA TAGTCATAGC CATGAAGAAC CAGCAATGGA AATGAAAAGA 1200 

30 GGACCACTTT TCAGTCATCT GTCTTCTCAA AACATAGAAG AAAGTGCCTA TTTTGATTCC 1260 

ACGTGGAAGG GTCTAACAGC TCTAGGAGGC CTGTATTTCA TGTTTCTTGT TGAACATGTC 1320 

CTCACATTGA TCAAACAATT TAAAGATAAG AAGAAAAAGA ATCAGAAGAA ACCTGAAAAT 1380 

GATGATGATG TGGAGATTAA GAAGCAGTTG TCCAAGTATG AATCTCAACT TTCAACAAAT 1440 

GAGGAGAAAG TAGATACAGA TGATCGAACT GAAGGCTATT TACGAGCAGA CTCACAAGAG 1500 

35 CCCTCCCACT TTGATTCTCA GCAGCCTGCA GTCTTGGAAG AAGAAGAGGT CATGATAGCT 1560 

CATGCTCATC CACAGGAAGT CTACAATGAA TATGTACCCA GAGGGTGCAA GAATAAATGC ,1620 

CATTCACATT TCCACGATAC ACTCGGCCAG TCAGACGATC TCATTCACCA CCATCATGAC 1680 

TACCATCATA TTCTCCATCA TCACCACCAC CAAAACCACC ATCCTCACAG TCACAGCCAG 1740 

CGCTACTCTC GGGAGGAGCT GAAAGATGCC GGCGTCGCCA CTTTGGCCTG GATGGTGATA 1800 

40 ATGGGTGATG GCCTGCACAA TTTCAGCGAT GGCCTAGCAA TTGGTGCTGC TTTTACTGAA 1860 

GGCTTATCAA GTGGTTTAAG TACTTCTG'TT GCTGTGTTCT GTCATGAGTT GCCTCATGAA 1920 

TTAGGTGACT TTGCTGTTCT ACTAAAGGCT GGCATGACCG TTAAGCAGGC TGTCCTTTAT 1980 

AATGCATTGT CAGCCATGCT GGCGTATCTT GGAATGGCAA CAGGAATTTT CATTGGTCAT 2040 

TATGCTGAAA ATGTTTCTAT GTGGATATTT GCACTTACTG CTGGCTTATT CATGTATGTT 2100 

45 GCTCTGGTTG ATATGGTACC TGAAATGCTG CACAATGATG CTAGTGACCA TGGATGTAGC 2160 

CGCTGGGGGT ATTTCTTTTT ACAGAATGCT GGGATGCTTT TGGGTTTTGG AATTATGTTA 2220 
CTTATTTCCA TATTTGAACA TAAAATCGTG TTTCGTATAA ATTTCTAG 

Seq ID NO: 177 Protein Sequence 
50 Protein Accession #: XP_084007 

l 11 21 31 41 51 

I I I I I I 

MARKLSVILI LTFALSVTNP LHELKAAAPP QTTEKISPNW ESGINVDLAI STRQYHLQQL 60 

55 FYRYGENNSL SVEGFRKLLQ NIGIDKIKRI HIHHDHDHHS DHEHHSDHER HSDHEHHSDH 120 

EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE HASGRRNVKD 180 

SVSASEVTST VYNTVSEGTH FLETIETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 

NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAIINQIDA 300 

RSCLIHTSEK KAEIPPKTYS LQIAWVGGFI AISIISFLSL LGVILVPLMN RVFFKFLLSF 360 

60 LVAUWGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ NIEESAYFDS 420 

TWKGLTALGG LYFMFLVEHV LTLIKQFKDK KKKNQKKPEN DDDVEIKKQL SKYESQLSTN 480 

EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVMIA HAHPQEVYNE YVPRGCKNKC 540 

HSHFHDTLGQ SDDLIHHHHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMVI 600 

MGDGLHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLKA GMTVKQAVLY 660 

65 NALSAMLAYL GMATGIPIGH YAENVSMWIF ALTAGLFMYV ALVDMVPEML HNDASDHGCS 720 
RWGYFFLQNA GMLLGFGIML LISIFEHKIV FRINF 

Seq ID NO: 17 B DNA Sequence 
Nucleic Acid Accession th BC010423 
70 Coding sequence! 248.. 1780 

1 11 21 31 41 . 51 

I ■ I I I I i 

CACAGCGTGG GAAGCAGCTC TGGGGGAGCT CGGAGCTCCC GATCACGGCT TCTTGGGGGT 60 

75 AGCTACGGCT GGGTGTGTAG AACGGGGCCG GGGCTGGGGC TGGGTCCCCT AGTGGAGACC 120 

CAAGTGCGAG AGGCAAGAAC TCTGCAGCTT CCTGCCTTCT GGGTCAGTTC CTTATTCAAG 180 

TCTGCAGCCG GCTCCCAGGG AGATCTCGGT GGAACTTCAG AAACGCTGGG CAGTCTGCCT 240 

TTCAACCATG CCCCTGTCCC TGGGAGCCGA GATGTGGGGG CCTGAGGCCT GGCTGCTGCT 300 
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GCTGCTACTG 
CGTGGTAACT 
CGGCGAGCAA 
ACTAGCGCTA 
GGAGCAGCCG 
GCAGGCGGAT 
GGCGCGGCTG 
ACTAGAAGAG 
CCCCAGCGTG 
CTCCCGCTCT 
GCAGCCACTG 
CATCCTCCAC 
GTGGCACATT 
CTCATACAAC 
CACTTTGGGC 
CAATGAGTTC 
CTCTGGGAAG 
ACTCTTGTTC 
GGCCCAGCAG 
CCGGAGGCTG 
GAGAGCCGAG 
AGAGCCCGAG 
TGAACTGCTG 
CAAACAGGCC 
CAATGGCATC 
CTAGGCCTGG 
ACACCCCCAT 
AACCCTTCTG 
CACTGTGTGT 
TGACTGTCCG 
AAGTGAACTG 
GTTTGGCGTG 
CAGACCCCAG 
CAGACCCAGG 
TCTCCTACCA 
GAGGCTTGAA 
ACATATTTTC 
ACTTTTAATT 
TTTTATTTTT 
CTGTAAAAAA 



CTGGCATCAT 
GTGGTGCTGG 
GTGGGGCAAG 
CTGCACTCCA 
CCGCCCCCAC 
GAGGGCGAGT 
CGGCTCCGAG 
GGCCAGGGCC 
ACCTGGGACA 
GCTGCCGTCA 
ACTTGTGTGG 
GTGTCCTTCC 
GGCAGAGAAG 
TGGACACGGC 
TTTCCCCCAC 
TCCTCAAGGG 
CAGGTGGACC 
TGCCTTCTGG 
ATGACCCAGA 
CATTCCCATC 
GGCCACCCTG 
GGCCGCAGTT 
TCTCCAGGCT 
ATGAACCATT 
TACATCAATG 
CTCCTTCTGT 
TTCTTGCGGA 
TTCATCGGGA 
GTGCATGTGT 
TGGAGGGGTG 
TGGTGTATGT 
TGTGTCATGT 
AGCAGTATTA 
TGTGCGGGCA 
CTTCGGAGCC 
CTGTTACAGA 
TGTAAATATA 
TTTTTCTTTT 
ATTTTTTTTT 
ACCAAAACCC 



TTACAGGCCG 
GCCAGGACGC 
TGGCATGGGC 
AATACGGGCT 
GCAACCCCCT 
ACGAGTGCCG 
TGCTGGTGCC 
TGACCCTGGC 
CGGAGGTCAA 
CCTCAGAGTT 
TGTCCCATCC 
TTGCTGAGGC 
GAGCTATGCT 
TGGATGGGCC 
TGACCACTGA 
ATTCTCAGGT 
TAGTGTCAGC 
TGGTGGTGGT 
AATATGAGGA 
ACACGGACCC 
ATAGTCTCAA 
ACTCCACGCT 
CTGGGCGGGC 
TTGTTCAGGA 
GGCGGGGACA 
TGACATGGGA 
AGATGCTCCC 
GGGCTCCACC 
GCCTGTGTGA 
ACTGTGTCCG 
GCCACGGGAT 
GGCTGTGTGT 
ATGATGCAGA 
TAGCTGGAGC 
ATGGGGGCAA 
AGCCCTCTGC 
CATGCGCCGG 
TTTTTTCTTG 
AGAGTTTGAG 
AAAAAAAAAA 



GTGCCCCGCG 
AAAACTGCCC 
TCGGGTGGAC 
TCATGTGAGC 
GGACGGCTCA 
GGTCAGCACC 
TCCCCTGCCC 
AGCCTCCTGC 
AGGCACAACG 
CCACTTGGTG 
TGGCCTGCTC 
CTCTGTGAGG 
CAAGTGCCTG 
TCTGCCCAGT 
GCACAGCGGC 
CACTGTGGAT 
CTCGGTGGTG 
GGTGCTCATG 
GGAGCTGACC 
CAGGAGCCAG 
GGACAACAGT 
GACCACGGTG 
CGAGGAGGAG 
GAATGGGACC 
CCTGGTCTGA 
GATTTTAGCT 
CATCCCACTG 
AATTGAGTCT 
GTGTTGACTG 
TGGTGTGTAT 
TTGAGTGGTT 
GACCTCTGCC 
GGTTGGAGGA 
TGGAATCTGC 
GTGTGAAGCA 
CCTCTGGTGG 
GAGCTTCTTG 
CCCTTTCCAT 
TCCAGCCTGG 
AAAAAAAAAA 



GGTGAGCTGG 
TGCTTCTACC 
GCGGGCGAAG 
CCGGCTTACG 
GTGCTCCTGC 
TTCCCCGCCG 
TCACTGAATC 
ACAGCTGAGG 
TCCAGCCGTT 
CCTAGCCGCA 
CAGGACCAAA 
GGCCTTGAAG 
AGTGAAGGGC 
GGGGTACGAG 
ATCTACGTCT 
GTTCTTGACC 
GTGGTGGGTG 
TCCCGATACC 
CTGACCAGGG 
CCGGAGGAGA 
AGCTGCTCTG 
AGGGAGATAG 
GAAGATCAGG 
CTACGGGCCA 
CCCAGGCCTG 
CATCTTGGGG 
ACTGCTTGAC 
CTCCCACCAT 
ACTGTGTGTG 
TATGCTGTCA 
GCGTGGGCAA 
TGAAAAAGCA 
GAGAGGTGGA 
CTCCGGTGTG 
GCCAGTCCCT 
CCTCTGGGCC 
CAGGAATACT 
TAGTTGTATT 
ACGATATAGC 



AGACCTCAGA 
GAGGGGACTC 
GCGCCCAGGA 
AGGGCCGCGT 
GCAACGCAGT 
GCAGCTTCCA 
CTGGTCCAGC 
GCAGCCCAGC 
CCTTCAAGCA 
GCATGAATGG 
GGATCACCCA 
ACCAAAATCT 
AGCCCCCTCC 
TGGATGGGGA 
GCCATGTCAG 
CCCAGGAAGA 
TGATCGCCGC 
ATCGGCGCAA 
AGAACTCCAT 
GTGTAGGGCT 
TGATGAGTGA 
AAACACAGAC 
ATGAAGGCAT 
AGCCCACGGG 
CCTCCCTTCC 
GCCTCCTTAA 
CTTTACCTCC 
GCATGCAGGT 
TGTGGAGGGG 
TATCAGAGTC 
CACTGTCAGG 
GGTATTTTCT 
GACTGTGGCT 
AGGGAACCTG 
GGGTCAGCCA 
TGCTGCATGT 
GCTCCGAATC 
TTTTATTTAT 
CAGACCCTGT 



Seq ID NO: 179 Protein Sequence 
Protein Accession #: AAH10423 



MPLSLGAEMW 
QVGQVAWARV 
DEGEYECRVS 
VTWDTEVKGT 
HVSFLAEASV 
GFPPLTTEHS 
FCLLVWWL 
EGHPDSLKDN 
AMNHFVQENG 



11 

I 

GPEAWLLLLL 
DAGEGAQELA 
TFPAGSFQAR 
TSSRSFKHSR 
RGLEDQNLWH 
GIYVCHVSNE 
MSRYHRRKAQ 
SSCSVMSEEP 
TLRAKPTGNG 



21 
I 

LLASFTGRCP 
LLHSKYGLHV 
LRLRVLVPPL 
SAAVTSEFHL 
IGREGAMLKC 
FSSRDSQVTV 
QMTQKYEEEL 
EGRSYSTLTT 
IYINGRGHLV 



31 
I 

AGELETSDW 
SPAYEGRVEQ 
PSLNPGPALE 
VPSRSMNGQP 
LSEGQPPPSY 
DVLDPQEDSG 
TLTRENSIRR 
VREIETQTEL 



41 

I 

TWLGQDAKL 
PPPPRNPLDG 
EGQGLTLAAS 
LTCWSHPGL 
NWTRLDGPLP 
KQVDLVSASV 
LHSHHTDPRS 
LSPGSGRAEE 



51 
I 

PCFYRGDSGE 
SVLLRNAVQA 
CTAEGSPAPS 
LQDQRITHIL 
SGVRVDGDTL 
VWGVIAALL 
QPEESVGLRA 
EEDQDEGIKQ 



Seq ID NO: 180 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 482.. 3007 



1 
I 

AACTGAGCTA 
TACAACCTAA 
TGCATTCAGC 
ATGCCGTGAA 
CTCTGCTTGA 
CTTTATGAAG 
GGTGTTTTCT 
ATTACAGATA 
CATGAAGCTG 
CCAAACTCCA 
AGATGGCACA 
TGTGCCACCA 
CACAAATGAC 
TGCAGATATT 
TCACAATTCT 
CCTGCAAGCA 



11 

I 

ACAAGAAATA 
GAAAGCAAGA 
AGCTTGCAAA 
TTTTAATTGA 
AATATTTTCA 
CTATGGGACT 
TTTTTTTAAT 
ACGTCAGGAA 
TGGATTCATC 
GTGCTCTCAT 
ATGCTAATAA 
TCACGACCTT 
TTTTCTGGGC 
GAGATAGGTG 
TTAGAAATTC 
GATAACAATT 



21 
I 

CTAGAAAAGG 
GTGATCAATC 
TGGTTAACTA 
GGGAAAAAGG 
ATGAAATGCT 
TGACAAAAAG 
AAAGGAATTG 
GAGTCTCTGC 
TCTTTTATTC 
CCAGAGGCTC 
ATTGTGAAGC 
TCCAACTAAG 
TTACCAATGC 
CATTTAATGG 
TTAAAGAGGA 
TTATCACAGT 



31 
I 

AGGAAGGAGA 
TCAGCTCTGT 
TATGCAAAAA 
GACAATTGCT 
CAGTATTCTA 
TGATATTTGA 
AATTACTTTG 
TTTACAGAAT 
ATCTCTCCTT 
TTGTGATTCT 
AAAAGGTATC 
CTTATTAAAT 
TATTTCAATA 
CCTTGGCCTC 
TACTTTCCAT 
GATTGAACCA 



41 
I 

ACATTGCTGC 
TAAACATCTT 
AGTCAGCATA 
TCAGGATGCT 
TCTTTGACCA 
GAAGAAAGTA 
AACACCTCTT 
CGGATTTCAT 
GCCTGTATAT 
CTTTGCAATT 
AAGATGGTAT 
AACGGCTTGA 
CACCTTGGAT 
CTGAAACAAC 
GGACTGGAAA 
AGTGCCTTTA 



51 

I 

AGCTTGGATC 
GTTTACTTAC 
GCTGTGAAGT 
CTAGTATGCA 
GAGGTTTTAA 
CGCAGTGGTT 
CCAGCTGTGC 
CACATGACAA 
CTTTACACTC 
GTGAGGAAAA 
CTGAAATAAG 
CGATGCTTCA 
TTAACAATAT 
TTCATATCAA 
ACCTGGAATT 
GCAAGCTCAA 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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CAGACTCAAA GTGTTAATTT TAAATGACAA TGCTATTGAG AGTCTTCCTC CAAACATCTT 1020 

CCGATTTGTT CCTTTAACCC ATCTAGATCT TCGTGGAAAT CAATTACAAA CATTGCCTTA 1080 

TGTTGGTTTT CTCGAACACA TTGGCCGAAT ATTGGATCTT CAGTTGGAGG ACAACAAATG 1140 

GGCCTGCAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATGC CTCCACAGTC 1200 

5 TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGGAAGTA TACTCAGTAG 1260 

ACTAAAGAAG GAATCTATTT GCCCTACTCC ACCAGTGTAT GAAGAACATG AGGATCCTTC 1320 

AGGATCATTA CATCTGGCAG CAACATCTTC AATAAATGAT AGTCGCATGT CAACTAAGAC 1380 

CACGTCCATT CTAAAACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA TTACAAAGCC 1440 

ATCCACTCAA CTTCCAGGAC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC 1500 

10 ATCAGGACTT CTAATACATT GTCAGGAGCG CAACATTGAA AGCTTATCAG ATCTGAGACC 1560 

TCCTCCGCAA AATCCTAGAA AGCTCATTCT AGCGGGAAAT ATTATTCACA GTTTAATGAA 1620 

GTCTGATCTA GTGGAATATT TCACTTTGGA AATGCTTCAC TTGGGAAACA ATCGTATTGA 1680 

AGTTCTTGAA GAAGGATCGT TTATGAACCT AACGAGATTA CAAAAACTCT ATCTAAATGG 1740 

TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 1800 

15 ATATCTTGAA TACAATGCCA TTAAGGAAAT ACTGCCAGGA ACCTTTAATC CAATGCCTAA 1860 
ACTTAAAGTC CTGTATTTAA ATAACAACCT CCTCCAAGTT TTACCACCAC ATATTTTTTC ' 1920 

AGGGGTTCCT CTAACTAAGG TAAATCTTAA AACAAACCAG TTTACCCATC TACCTGTAAG 1980 

TAATATTTTG GATGATCTTG ATTTACTAAC CCAGATTGAC CTTGAGGATA ACCCCTGGGA 2040 

CTGCTCCTGT GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 2100 

20 GACAGATGAC ATCCTCTGCA CTTCCCCCGG GCATCTCGAC AAAAAGGAAT TGAAAGCCCT 2160 

AAATAGTGAA ATTCTCTGTC CAGGTTTAGT AAATAACCCA TCCATGCCAA CACAGACTAG 2220 

TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG GCTGATACTA TTTTACGATC 22 80 

TCTTACGGAC GCTGTGCCAC TGTCTGTTCT AATATTGGGA CTTCTGATTA TGTTCATCAC 2340 

TATTGTTTTC TGTGCTGCAG GGATAGTGGT TCTTGTTCTT CACCGCAGGA GAAGATACAA 2400 

25 AAAGAAACAA GTAGATGAGC AAATGAGAGA CAACAGTCCT GTGCATCTTC AGTACAGCAT 2460 

GTATGGCCAT AAAACCACTC ATCACACTAC TGAAAGACCC TCTGCCTCAC TCTATGAACA 2S20 

GCACATGGTG AGCCCCATGG TTCATGTCTA TAGAAGTCCA TCCTTTGGTC CAAAGCATCT 2580 

GGAAGAGGAA GAAGAGAGGA ATGAGAAAGA AGGAAGTGAT GCAAAACATC TCCAAAGAAG 2640 

TCTTTTGGAA CAGGAAAATC ATTCACCACT CACAGGGTCA AATATGAAAT ACAAAACCAC 2700 

30 GAACCAATCA ACAGAATTTT TATCCTTCCA AGATGCCAGC TCATTGTACA GAAACATTTT 2760 

AGAAAAAGAA AGGGAACTTC AGCAACTGGG AATCACAGAA TACCTAAGGA AAAACATTGC 2820 

TCAGCTCCAG CCTGATATGG AGGCACATTA TCCTGGAGCC CACGAAGAGC TGAAGTTAAT 2880 

GGAAACATTA ATGTACTCAC GTCCAAGGAA GGTATTAGTG GAACAGACAA AAAATGAGTA 2940 

TTTTGAACTT AAAGCTAATT TACATGCTGA ACCTGACTAT TTAGAAGTCC TGGAGCAGCA 3000 

35 AACATAGATG GAGAGTTTGA GGGCTTTCGC AGAAATGCTG TGATTCTGTT TTAAGTCCAT 3060 

ACCTTGTAAA TAAGTGCCTT ACGTGAGTGT GTCATCAATC AGAACCTAAG CACAGCAGTA 3120 

AACTATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 3180 

GCATTATCTT CAGGCAATTT AGTCTGTCCC AAATAAAATC AATCCTTGCA TGTAAATC 3238 

40 Seq ID NO: 181 Protein Sequence 

Protein Accession ih Eos sequence 

1 11 21 31 41 51 

.-II I III 

4j MKLWIHLFYS SLLACISLHS QTPVLSSRGS CDSLCNCEEK DGTMLINCEA KGIKMVSEIS 60 

VPPSRPFQLS LLNNGLTMLH TNDPSGLTNA I SIHLGFNNI ADIEIGAFNG LGLLKQLHIN 120 

HNSLEILKED TPHGLENLEF LQADNNFITV IEPSAFSKLN RLKVLILNDN AIESLPPNIF 180 

RFVPLTHLDL RGNQLQTLPY VGFLEHIGRI LDLQLEDNKW ACNCDLLQLK TWLENMPPQS 240 

IIGDWCNSP PFFKGSILSR LKKESICPTP PVYEEHEDPS GSLHLAATSS INDSRMSTKT 300 

50 TSILKLPTKA PGLIPYITKP STQLPGPYCP IPCNCKVLSP SGLLIHCQER NIESLSDLRP 360 

PPQNPRKLIL AGNIIHSLMK SDLVEYFTLE MLHLGNNRIE VLEEGSFMNL TRLQKLYLNG 420 

NHLTKLSKGM FLGLHNLEYL YLEYNAIKEI LPGTFNPMPK LKVLYLNNNli LQVLPPHIFS 480 

GVPLTKVNLK TNQFTHLPVS NILDDLDLLT QIDLEDNPWD CSCDLVGLQQ WIQKLSKNTV 540 

TDDILCTSPG HLDKKELKAL NSEILCPGLV NNPSMPTQTS YLMVTTPATT TNTADTILRS 600 

55 LTDAVPLSVL ILGLLIMFIT IVFCAAGIW LVLHRRRRYK KKQVDEQMRD NSPVHLQYSM 660 

YGHKTTHHTT ERPSASLYEQ HMVSPMVHVY RSPSFGPKHL EEEEERNEKE GSDAKHLQRS 720 

LLEQENHSPL TGSNMKYKTT NQSTEFLSFQ DASSLYRNIL EKERELQQLG ITEYLRKNIA 780 

QLQPDMEAHY PGAHEELKLM ETLMYSRPRK VLVEQTKNEY FELKANLHAE PDYLEVLEQQ 840 
T 



60 



65 



Seq ID NO: 182 DNA Sequence 

Nucleic Acid Accession Jh XM_J>35292.2 

Coding sequence: 1..1S24 



1 11 21 31 41 51 

111(11 

ATGGCGGGTG CGGGCCCGAA GCGGCGCGCG CTAGCGGCGC CGGCGGCCGA GGAGAAGGAA 60 

GAGGCGCGGG AGAAGATGCT GGCCGCCAAG AGCGCGGACG GCTCGGCGCC GGCAGGCGAG 120 

70 GGCGAGGGCG TGACCCTGCA GCGGAACATC ACGCTGCTCA ACGGCGTGGC CATCATCGTG 180 

GGGACCATTA TCGGCTCGGG CATCTTCGTG ACGCCCACGG GCGTGCTCAA GGAGGCAGGC 240 

TCX5CCGGGGC TGGCGCTGGT GGTGTGGGCC GCGTGCGGCG TCTTCTCCAT CGTGGGCGCG 300 

CTCTGCTACG CGGAGCTCGG CACCACCATC TCCAAATCGG GCGGCGACTA CGCCTACATG 360 

CTGGAGGTCT ACGGCTCGCT GCCCGCCTTC CTCAAGCTCT GGATCGAGCT GCTCATCATC 420 

75 CGGCCTTCAT CGCAGTACAT CGTGGCCCTG GTCTTCGCCA CCTACCTGCT CAAGCCGCTC 480 

TTCCCCACCT GCCCGGTGCC CGAGGAGGCA GCCAAGCTCG TGGCCTGCCT CTGCGTGCTG 540 

CTGCTCACGG CCGTGAACTG CTACAGCGTG AAGGCCGCCA CCCGGGTCCA GGATGCCTTT 600 

GCCGCCGCCA AGCTCCTGGC CCTGGCCCTG ATCATCCTGC TGGGCTTCGT CCAGATCGGA 660 
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5 
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75 



AAQGGTGATG 
GGGAACATTG 
AATTTCGTCA 
TCCCTGCCCA 
TCCACCGAGC 
GGCGTCATGT 
GGGTCCCTGT 
TCCATCCTCT 
TGTGTGATGA 
AGCTTCTTCA 
AGAAAGCCTG 
CTGGCCTGCC 
GGCTTCACCA 
AAGCCCAAGT 
CAGGTGGTCC 



TGTCCAATCT 
TGCTGGCATT 
CAGAGGAAAT 
TCGTGACGCT 
AGATGCTGTC 
CCTGGATCAT 
TCACATCCTC 
CCATGATCCA 
CGCTGCTCTA 
ACTGGCTCTG 
AGCTTGAGCG 
TCTTCCTGAT 
TCATCCTCAG 
GGCTCCTCCA 
CCCAGGAGAC 



AGATCCCAAC 
ATACAGCGGC 
GATCAACCCC 
GGTGTACGTG 
GTCCGAGGCC 
CCCCGTCTTC 
CAGGCTCTTC 
CCCACAGCTC 
CGCCTTCTCC 
CGTGGCCCTG 
GCCCATCAAG 
CGCCGTCTCC 
CGGGCTGCCC 
GGGCATCTTC 
ATAG 



TTCTCATTTG 
CTCTTTGCCT 
TACAGAAACC 
CTGACCAACC 
GTGGCCGTGG 
GTGGGCCTGT 
TTCGTGGGGT 
CTCACCCCCG 
AAGGACATCT 
GCCATCATCG 
GTGAACCTGG 
TTCTGGAAGA 
GTCTACTTCT 
TCCACGACCG 



AAGGCACCAA 
ATGGAGGATG 
TGCCCCTGGC 
TGGCCTACTT 
ACTTCGGGAA 
CCTGCTTCGG 
CCCGGGAAGG 
TGCCGTCCCT 
TCTCCGTCAT 
GCATGATCTG 
CCCTGCCTGT 
CACCCGTGGA 
TCGGGGTCTG 
TCCTGTGTCA 



ACTGGATGTG 
GAATTACTTG 
CATCATCATC 
CACCACCCTG 
CTATCACCTG 
CTCOGTCAAT 
CCACCTGCCC 
CGTGTTCACG 
CAACTTCTTC 
GCTGCGCCAC 
GTTCTTCATC 
GTGTGGCATC 
GTGGAAAAAC 
GAAGCTCATG 



Seq ID NO: 183 Protein Sequence 
Protein Accession #: XP_035292.2 



I 

MAGAGPKRRA 
GTIIGSGIFV 
LEVYGSLPAF 
LLTAVNCYSV 
GNIVLALYSG 
STEQMLSSEA 
SILSMIHPQL 
RKPELERPIK 
KPKWIjLQGIF 



11 
I 

LAAPAAEEKE 
TPTGVLXEAG 
LKLWIELLII 
KAATRVQDAF 
LFAYGGWNYL 
VAVDFGNYHL 
LTPVPSLVFT 
VNLALPVFFI 
STTVLCQKLM 



21 
I 

EAREKMLAAK 
SPGLALWWA 
RPSSQYIVAL 
AAAKLLALAL 
NFVTEEMINP 
GVMSWIIPVF 
CVMTLLYAFS 
LACLFLIAVS 
QWPQET 



31 
I 

SADGSAPAGE 
ACGVFSIVGA 
VFATYLLKPL 
IIIiLGFVQIG 
YRNLPLAIII 
VGLSCFGSVN 
KDIFSVINFF 
FWKTPVECGI 



41 
I 

GEGVTLQRNI 
LCYAELGTTI 
FPTCPVPEEA 
KGDVSNLDPN 
SLPIVTLVYV 
GSLFTSSRLF 
SFFNWLCVAL 
GFTIILSGIiP 



51 
I 

TLLNGVAI IV 
SKSGGDYAYM 
AKLVACLCVL 
FSFEGTKLDV 
LTNLAYFTTL 
FVGSREGHLP 
AIIGMIWLRH 
VYFFGVWWKN 



Seq id NO: 184 DNA Sequence 

Nucleic Acid Accession #: NM_0052 68.l 

Coding sequence: 1..822 



ATGAACTGGA 
GGGCGCATCT 
GAGCGTGTGT 
TCCAACGTCT 
CTTATCCTGG 
CAGGAGAAGA 
GGCAAGAAGC 
GTGGACATCG 
GTGGTCAAGT 
TCAGAGAAGA 
AACCTCGTGG 
AAAGCTCAAG 
GACGACCTCC 
CCAGACCGCC 



11 
I 

GTATCTTTGA 
GGCTGTCTCT 
GGAGTGATGA 
GCTTTGATGA 
TGACATGCCC 
GGCACCGAGA 
GGGGTGGGCT 
CCTTTCTCTA 
GCCACGCAGA 
ACATTTTCAC 
AGCTCATCTA 
CCATGTGCAC 
TTTCGGGTGA 
CCCGAGACCA 



21 
I 

GGGACTCCTG 
GGTCTTCATC 
CCACAAGGAC 
GTTCTTCCCT 
CTCACTGCTC 
AGCCCATGGG 
CTGGTGGACA 
TGTGTTCCAC. 
TCCATGTCCC 
CCTCTTCATG 
CCTGGTGAGC 
AGGTCATCAC 
CCTCATCTTT 
TGTGAAGAAA 



31 
I 

AGTGGGGTCA 
TTCCGCGTGC 
TTCGACTGCA 
GTGTCCCATG 
GTGGTCATGC 
GAGAACAGTG 
TATGTCTGCA 
TCATTCTACC 
AATATAGTGG 
GTGGCCACAG 
AAGAGATGCC 
CCCCACGGTA 
CTGGGCTCAG 
ACCATCTTGT 



41 
I 

ACAAGTACTC 
TGGTGTACCT 
ATACTCGCCA 
TGCGCCTCTG 
ACGTGGCCTA 
GGCGCCTCTA 
GCCTAGTGTT 
CCAAATATAT 
ACTGCTTCAT 
CTGCCATCTG 
ACGAGTGCCT 
CCACCTCTTC 
ACAGTCATCC 
GA 



51 
I 

CACAGCCTTT 
GGTGACGGCC 
GCCCGGCTGC 
GGCCCTGCAG 
CCGGGAGGTT 
CCTGAACCCC 
CAAGGCGAGC 
CCTCCCTCCT 
CTCCAAGCCC 
CATCCTGCTC 
GGCAGCAAGG 
CTGCAAACAA 
TCCTCTCTTA 



Seq ID NO: 185 Protein Sequence 
Protein Accession #: np 005259.1 



1 11 

I I 
MNWSIFEGLL SGVNKYSTAF 
SNVCFDEFFP VSHVRLWALQ 
GKKRGGLWWT YVCSLVFKAS 
SEKNIFTLFM VATAAICILL 
DDLLSGDIilF LGSDSHPPLL 



21 
I 

GRIWLSLVFI 
LILVTCPSLL 
VDIAFLYVFH 
NLVELIYLVS 
PDRPRDHVKK 



31 41 

I J 
FRVLVYLVTA ERVWSDDHKD 
WMHVAYREV QEKRHREAHG 
SFYPKYILPP WKCHADPCP 
KRCHECLAAR KAQAMCTGHH 
TIL 



51 
I 

FDCNTRQPGC 
ENSGRLYLNP 
NIVDCFISKP 
PHGTTSSCKQ 



Seq ID NO: 186 DNA Sequence 

Nucleic Acid Accession #• NM_002391.1 

Coding sequence : 2 5 . . 4 5 7 



1 
I 

CGGGCGAAGC 
CGCCCTGCTG 
CCCGGGGAGC 
CGGCGTGGGT 
GCCCTGCAAC 
TGCGTGTGAT 
CAATGCTCAG 
AAAGGCCAAA 
GCCCCTGGTG 



11 
I 

AGCGCGGGCA 
GCGCTCACCT 
GAGTGCGCTG 
TTCCGCGAGG 
TGGAAGAAGG 
GGGGGCACAG 
TGCCAGGAGA 
GCCAAGAAAG 
TCACATGGGG 



21 
I 

GCGAGATGCA 
CCGCGGTCGC 
AGTGGGCCTG 
GCACCTGCGG 
AGTTTGGAGC 
GCACCAAAGT 
CCATCCGCGT 
GGAAGGGAAA 
CCTGGCCACG 



31 
I 

GCACCGAGGC 
CAAAAAGAAA 
GGGGCCCTGC 
GGCCCAGACC 
CGACTGCAAG 
CCGCCAAGGC 
CACCAAGCCC 
GGACTAGACG 
CCCTCCCTCT 



41 
I 

TTCCTCCTCC 
GATAAGGTGA 
ACCCCCAGCA 
CAGCGCATCC 
TACAAGTTTG 
ACCCTGAAGA 
TGCACCCCCA 
CCAAGCCTGG 
CCCAGGCCCG 



51 
I 

TCACCCTCCT 
AGAAGGGCGG 
GCAAGGATTG 
GGTGCAGGGT 
AGAACTGGGG 
AGGCGCGCTA 
AGACCAAAGC 
ATGCCAAGGA 
AGATGTGACC 
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CACCAGTGCC TTCTGTCTGC TCGTTAGCTT TAATCAATCA TGCCCTGCCT TGTCCCTCTC 600 

ACTCCCCAGC CCCACCCCTA AGTGCCCAAA GTGGGGAGGG ACAAGGGATT CTGGGAAGCT 660 

TGAGCCTCCC CCAAAGCAAT GTGAGTCCCA GAGCCCGCTT TTGTTCTTCC CCACAATTCC 720 

ATTACTAAGA AACACATCAA ATAAACTGAC TTTTTCCCCC CAATAAAAGC TCTTCTTTTT 780 
5 TAATAT 

Seq ID NO: 187 Protein Sequence 
Protein Accession 8: NP_002382.1 

10 1 11 21 31 41 51 

I I I I I I 

MQHRGFLLLT LLALLALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGFREGT 60 
CGAQTQRIRC RVPCNWKKEP GADCKYKFEN WGACDGGTGT KVRQGTLKKA RYNAQCQETI 120 
RVTKPCTPKT KAKAKAKKGK GKD 
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Seq ID NO: 188 DNA Sequence 
Nucleic Acid Accession #; NM_015507 
Coding sequence: 241-1902 

1 11 21 31 41 51 

I I I 111 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 

CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 

CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTACG 180 

GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 

ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 

GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 

TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 

TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 

AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 

AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 

CTCAGTGGCC ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 

ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 720 

TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 

GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 

AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 

AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 

GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 

ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 

AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1140 

CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 

ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 

GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 

AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 

CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 

GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 

TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1560 

GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 

AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 

TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 1740 

TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 

GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATCQCAG TGGATGGCGT CTTGCTTGTT 1860 

TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGACT GAATGTTACT ATCTTTATAT 1920 

TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 

TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 

TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 

TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 

CAGTATATCT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 

TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 

TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 

Seq ID NO: 189 Protein Sequence 
Protein Accession #: NP 056322.2 



1 11 21 31 41 51 

III III 

MPLPWSLALP LLLSWVAGGP GNAASARHHG LLASARQPGV CHYGTKLACC YGWRRNSKGV 60 

CEATCEPGCK PGECVGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCFC 120 

70 LSGHMLMPDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDECAS 180 

GKVICPYNRR CVNTFGSYYC KCHIGFELQY 1SGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 

GSFKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTIKDRI KKLLAHKNSM KKKAKIKNVT 300 

PEPTRTPTPK VNLQPFNYEE IVSRGGNSHG GKKGNEEKMK EGLEDEKREE KALKNDIEER 360 

n SLRGDVFPPK VNEAGEFGLI LVQRKALTSK. LEHKDLNISV DCSFNHGICD WKQDREDDFD 420 

75 WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 4 B0 

FVKNSNNALA WEKTTSEDEK WKTGKIQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVLLV 540 
SGLCPDSLIiS VDD 
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Seq ID NO: 190 DNA Sequence 
Nucleic Acid Accession #: NM_006475 
Coding sequence: 12.. 2522 



11 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 



AGAGACTCAA 
ACCCTATAAA 
GGGACCAAGG 
TCAGCACTTG 
ATGAATGTTG 
CCATTGACCA 
CTGACGCCTC 
CGAGTAATGA 
TGAATGTTGA 
AGGACTTAAA 
ACCATTATCC 
TTGCAACAAA 
TTCAAGACTT 
CGGACATATT 
AGGCTTTTGA 
CCGAAGCTCT 
GAGGAGCAGT 
GTATAACAGT 
TGATCCATTT 
CTGGAAAACA 
TGAGGCCAGA 
CTCTCAGCAT 
AAGTTGGCCT 
TCAGAGTCTT 
GTAAGCAAGG 
AATCCCTCCA 
AAGCTGCAGA 
CCAATGATGC 
ATGCTCTTCA 
TTGAACCTGG 
AAGTAAATGA 
CAAATGGTGT 
ATGATCAACT 
GTGGTAGCAC 
TTGTGGAACC 
GACCCACACT 
GTGAAACAAT 
TTGATGGAGT 
GTCCTGAAAT 
AGAAATTGTT 
ATTTATTTGA 
TGCAAGCCAA 
GAAAATCCAA 
AAAATTGTGA 
CAAATAATTC 
TGTGGAGTTA 
TCATCTTGAC 
CCAGATTCAT 
GCCTTGTATG 
GCTAAGTTAT 
ATTTCTATAT 
CTCAGAATGT 
AAAACCATTT 
GATTTAAGTT 



GATGATTCCC 
CGCCAACAAT 
CCCAAATGTC 
TAAGAACTGG 
CCCTGGTTAT 
TGTTTATGGC 
AAAACTGAGG 
GGCTTGGGAC 
ATTACTGAAT 
AAATGGCATG 
TAATGGGGTT 
TGGTGTTGTC 
CATTGAAGCA 
GGAGGCCCTT 
GAAACTTCCA 
TATGAAGTAC 
CTTTGAGACG 
AAATGGAATC 
GATTGATCAG 
GCAAACCACC 
TGGAGAATAC 
GGTTCAGCGC 
TAATGAGCTT 
CGTATATCGT 
GAGAAACGGT 
TGAAAAGTTA 
CTTGAAAGAG 
TTTTAAGGGA 
AAACATCATT 
TGTTACTAAC 
TACACTTCTG 
AATTCATGTT 
GCTGGAAATA 
CTTCAAAGAA 
AAAAATTAAA 
AACAAAAGTC 
AACTGAAGTG 
GCCTGTGGAA 
AAAATACACT 
ACAAGAAGAG 
AGATGAAGAA 
CAAAAAAGTT 
AAACCAGAAA 
GAGCCAAGTT 
TGAACACAAA 
GCCTCCTGTG 
ATTAAAAGTT 
TACAATTCAA 
TATGTTATGG 
AAAAATAGGT 
GAGTGGGTTT 
TTGTCATATG 
TTCAGATATA 
AAAAAGTGGT 



21 
I 

TTTTTACCCA 
CATTATGACA 
TGTGCCCTTC 
TATAAAAAGT 
ATGAGAATGG 
ACTCTGGGCA 
GAGGAGATCG 
AACTTGGATT 
GCTTTACATA 
ATTATTCCTT 
GTCACTGTTA 
CATGTCATTG 
GAAGATGACC 
GGAAGAGACG 
CGAGGTGTCC 
CACATCTTAA 
CTGGAAGGAA 
AAAATGGTGA 
GTCCTAATTC 
TTCACGGATC 
ACTTTGCTGG 
CTCCTTAAAT 
TACAACGGGC 
ACAGCTGTCT 
GCGATTCACA 
AAACAAGATA 
CTCCTGACAC 
ATGACTAGTG 
CTTTATCACC 
ATTTTAAAGA 
GTGAATGAAT 
GTAGATAAAC 
CTTAATAAAT 
ATCCCCGTGA 
GTGATTGAAG 
AAAATTGAAG 
ATCCATGGAG 
ATAACTGAAA 
AGGATTTCTA 
GTCACCAAGG 
ATTAAAAGAC 
CAAGGTTCTA 
AAAATGTTTA 
GACTTCAGGA 
TTTAATATTT 
GTAAAGGAAT 
CTGGCTAACT 
ATCGAAGAGT 
ATACATAAAA 
GCTTGGTGTA 
ACTGGTAAAT 
CTTCTTGCAA 
AAGAGAATTA 
TTGGACTTGG 



31 
I 

TGTTTTCTCT 
AGATCTTGGC 
AACAGATTTT 
CCATCTGTGG 
AAGGAATGAA 
TCGTGGGAGC 
AGGGAAAGGG 
CTGATATCCG 
GTCACATGAT 
CAATGTATA* 
ATTGTGCTCG 
ACCGTGTGCT 
TTTCATCTTT 
GTCACTTCAC 
TAGAAAGGTT 
ATACTCTCCA 
ATACAATTGA 
ACAAAAAGGA 
CTGATTCTGC 
TTGTGGCCCA 
CACCTGTGAA 
TAATTCTGCA 
AAATACTGGA 
GCATTGAAAA 
TATTCCGCGA 
AGCGCTTTAG 
AACCTGGAGA 
AAGAAAAAGA 
TGACACCAGG 
CCACACAAGG 
TGAAATCAAA 
TCCTCTATCC 
TAATCAAATA 
CTGTCTATAC 
GCAGTCTTCA 
GTGAACCTGA 
AGCCAATTAT 
AAGAGACACG 
CTGGAGGTGG 
TCACCAAATT 
TGCTTCAGGG 
GAAGACGATT 
TACAACCCTA 
ACTGAAACAT 
TTTTTTCTGA 
TGAAGAAAAT 
TTGGAATCCA 
TGTGAACTGT 
TGCACGCAAG 
CAAAACTTTT 
TATGTTATTT 
TGCATATTTT 
CTTCAAATTG 
GAA 



41 
I 

ACTATTGCTG 
TCATAGTCGT 
GGGCACCAAA 
ACAGAAAACG 
AGGCTGCCCA 
CACCACAACG 
ATCCTTCACT 
TAGAGGTTTG 
TAATAAGAGA 
CAATTTGGGG 
AATCATCCAT 
TACACAAATT 
TAGAGCAGCT 
ACTCTTTGCT 
CATGGGAGAC 
GTGTTCTGAG 
GATAGGATGT 
TATTGTGACA 
CAAACAAGTT 
ATTAGGCTTG 
TAATGCATTT 
GAATCACATA 
AACCATCGGA 
TTCATGCATG 
GATCATCAAG 
CACCTTCCTC 
CTGGACATTA 
AATTCTGATA 
AGTTTTCATT 
AAGCAAAATC 
AGAATCTGAC 
AGCAGACACA 
CATCCAAATT 
AACTAAAATT 
GCCTATTATC 
ATTCAGACTG 
TAAAAAATAC 
AGAAGAACGA 
AGAAACAGAA 
CATTGAAGGT 
AGACACACCC 
AAGGGAAGGT 
AGTCAATAAC 
CAGCACAAAG 
ATGAGAAACA 
ATAACACCTT 
TTAGAGAAAA 
TATCCCATTG 
CCATTATCTC 
TATATCAAAA 
TTTACAACTA 
TTAATCTCAA 
AGTAATTCAG 



51 
I 

CTTATTGTTA 
ATCAGGGGTC 
AAGAAATACT 
ACTGTTTTAT 



GCAGTTTTGC 
CAGCGCTATT 
TACTTTGCAC 
GAGAGCAACG 
ATGTTGACCA 
CTTTTCATTA 
GGGAACCAGA 
GGTACCTCAA 
GCCATCACAT 
CCCACCAATG 
AAAGTGGCTT 
TCTATTATGG 
GACGGTGACA 
AATAATGGTG 
ATTGAGCTGG 
GCATCTGCTC 
TCTGATGATA 
TTGAAAGTAA 
GGCAAACAGC 
GAGAAAGGGA 
CCAGCAGAGA 
AGCCTACTTG 
TTTGTGCCAA 
CGGGACAAAA 
GG AAAAGGAT 
TTTCTGAAAG 
ATCATGACAA 
CCTGTTGGAA 
AAGTTTGTTC 
ATAACCAAAG 
AAAACTGAAG 
ATTAAAGAAG 
ACCAAAATCA 
ATCATTACAG 
GAAACTCTGA 
GGTGATGGTC 
GTGAGGAAGT 
CGTTCTCAGT 
CTGACCTTAG 
AAGCAATCAT 
TGAGGGAAAT 
ACACCCTTTT 
ATCCTTGTCA 
AAAAGACCGA 
TCCATGGGAA 
GGCTTTGCAC 
ATTTTGTACT 
ACGTTTCAAT 
AAAAACTCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 



65 



Seq ID NO: 191 Protein Sequence 
Protein Accession #: BAA02836.1 



] } X 21 31 41 51 

II I III 

MIPFLPMFSL LliLIVNPIN ANNHYDKILA HSRIRGRDQG PNVCALQQIL GTKKKYFSTC 60 
KNWYKKSICG QKTTVLYECC PGYMRMEGMK GCPAVLPIDH VYGTU3IVGA TTTORYSDAS 120 

/U KLREEIEGKG SFTYFAPSNE AWDNLDSDIR RGLESNVNVE LLNALHSHMI NKRMLTKDLK 180 
NGMIIPSMYN NIX3LFINHYP NGWTVNCAR IIHGNQIATN GWHVIDRVL TQIGTSIODF 240 
IEAEDDLSSF RAAAITSDIL EALGRDGHFT LFAPTNEAFE KLPRGVLERF MGDKVASEAL 
MKYHIltNTLQ CSESIMGGAV FETLEGNTIE IGCDGDSITV NGIKMVNKKD IVTNNGVIHL 

- IDQVLIPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTMAPVN NAFSDDTLSM 

/5 VQRLLKLILQ nhilkvkvgl nelyngqile tiggkqlrvf vyrtavcien scme^skS 

RNGAIHIFRE IIKPAEKSLH EKLKQDKRFS TFLSLLEAAD LKELLTQPGD WTLFVPTNDA 
FKGMTSEEKE ILIRDKNALQ NIILYHLTPG VFIGKGFEPG VTNILKTTQG SKIFL^ND 
TLLVNELKSK ESDIMTTNGV IHWDKLLYP ADTPVGNDQL LEILNKLIKY IQIKFVRGST 
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300 
360 
420 
480 
54 0 
600 
660 
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FKEIPVTVYT TKIITKWEP KIKVIEGSLQ PIIKTEGPTL TKVKIEGEPE PRLIKEGETI 720 

TEVIHGEPII KKYTKIIDGV PVEITEKETR EERIITGPEI KYTRISTGGG ETEETLKKLL 780 
QEEVTKVTKF IEGGDGHLFE DEEIKRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 

5 Seq ID NO: 192 DNA Sequence 

Nucleic Acid Accession fh NM_006670 
Coding sequence: 85.. 1347 

1 11 21 31 41 51 

10 | | | | || 

CCGGCTCGCG CCCTCCGGGC CCAGCCTCCC GAGCCTTCGG AGCGGGCGCC GTCCCAGCCC 60 

AGCTCCGGGG AAACGCGAGC CGCGATGCCT GGGGGGTGCT CCCGGGGCCC CGCCGCCGGG 120 

GACGGGCGTC TGCGGCTGGC GCGACTAGCG CTGGTACTCC TGGGCTGGGT CTCCTCGTCT 180 

TCTCCCACCT CCTCGGCATC CTCCTTCTCC TCCTCGGCGC CGTTCCTGGC TTCCGCCGTG 240 

15 TCCGCCCAGC CCCCGCTGCC GGACCAGTGC CCCGCGCTGT GCGAGTGCTC CGAGGCAGCG 300 

CGCACAGTCA AGTGCGTTAA CCGCAATCTG ACCGAGGTGC CCACGGACCT GCCCGCCTAC 3 60 

GTGCGCAACC TCTTCCTTAC CGGCAACCAG CTGGCCGTGC TCCCTGCCGG CGCCTTCGCC 420 

CGCCGGCCGC CGCTGGCGGA GCTGGCCGCG CTCAACCTCA GCGGCAGCCG CCTGGACGAG 480 

GTGCGCGCGG GCGCCTTCGA GCATCTGCCC AGCCTGCGCC AGCTCGACCT CAGCCACAAC 540 

20 CCACTGGCCG ACCTCAGTCC CTTCGCTTTC TCGGGCAGCA ATGCCAGCGT CTCGGCCCCC 600 

AGTCCCCTTG TGGAACTGAT CCTGAACCAC ATCGTGCCCC CTGAAGATGA GCGGCAGAAC 660 

CGGAGCTTCG AGGGCATGGT GGTGGCGGCC CTGCTGGCGG GCCGTGCACT GCAGGGGCTC 720 

CGCCGCTTGG AGCTGGCCAG CAACCACTTC CTTTACCTGC CGCGGGATGT GCTGGCCCAA 780 

CTGCCCAGCC TCAGGCACCT GGACTTAAGT AATAATTCGC TGGTGAGCCT GACCTACGTG 840 

25 TCCTTCCGCA ACCTGACACA TCTAGAAAGC CTCCACCTGG AGGACAATGC CCTCAAGGTC 900 

CTTCACAATG GCACCCTGGC TGAGTTGCAA GGTCTACCCC ACATTAGGGT TTTCCTGGAC 960 

AACAATCCCT GGGTCTGCGA CTGCCACATG GCAGACATGG TGACCTGGCT CAAGGAAACA 1020 

GAGGTAGTGC AGGGCAAAGA CCGGCTCACC TGTGCATATC CGGAAAAAAT GAGGAATCGG 1080 

GTCCTCTTGG AACTCAACAG TGCTGACCTG GACTGTGACC CGATTCTTCC CCCATCCCTG 1140 

30 CAAACCTCTT ATGTCTTCCT GGGTATTGTT TTAGCCCTGA TAGGCGCTAT TTTCCTCCTG 1200 

GTTTTGTATT TGAACCGCAA GGGGATAAAA AAGTGGATGC ATAACATCAG AGATGCCTGC 1260 

AGGGATCACA TGGAAGGGTA TCATTACAGA TATGAAATCA ATGCGGACCC CAGATTAACA 1320 

AACCTCAGTT CTAACTCGGA TGTCTGAGAA ATATTAGAGG ACAGACCAAG GACAACTCTG 1380 

CATGAGATGT AGACTTAAGC TTTATCCCTA CTAGGCTTGC TCCACTTTCA TCCTCCACTA 1440 

35 TAGATACAAC GGACTTTGAC TAAAAGCAGT GAAGGGGATT TGCTTCCTTG TTATGTAAAG 1500 

TTTCTCGGTG TGTTCTGTTA ATGTAAGACG ATGAACAGTT GTGTATAGTG TTTTACCCTC 1560 

TTCTTTTTCT TGGAACTCCT CAACACGTAT GGAGGGATTT TTCAGGTTTC AGCATGAACA 1620 

TGGGCTTCTT GCTGTCTGTC TCTCTCTCAG TACAGTTCAA GGTGTAGCAA GTGTACCCAC 1680 

ACAGATAGCA TTCAACAAAA GCTGCCTCAA CTTTTTCGAG AAAAATACTT TATTCATAAA 1740 

40 TATCAGTTTT ATTCTCATGT ACCTAAGTTG TGGAGAAAAT AATTGCATCC TATAAACTGC 1800 

CTGCAGACGT TAGCAGGCTC TTCAAAATAA CTCCATGGTG CACAGGAGCA CCTGCATCCA 1860 

AGAGCATGCT TACATTTTAC TGTTCTGCAT ATTACAAAAA ATAACTTGCA ACTTCATAAC 1920 

TTCTTTGACA AAGTAAATTA CTTTTTTGAT TGCAGTTTAT ATGAAAATGT ACTGATTTTT 1980 

TTTTAATAAA CTGCATCGAG ATCCAACCGA CTGAATTGTT AAAAAAAAAA AAAAATAAAG 2040 

45 ATTCTTAAAA GAA 

Seq ID NO: 193 Protein Sequence 
Protein Accession #: CAA82324.1 

50 1 11 21 31 41 51 

III III 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLPD 60 

QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLAVLPAGA FARRPPLAEL "120 

AALNLSGSRL DEVRAGAFEH LPSLRQLDLS HNPLADLSPF AFSGSNASVS APSPLVELIL 180 

55 NHIVPPEDER QNRSFEGMW AALLAGRALQ GLRRLELASN HFLYLPRDVL AQLPSIiRHLD 240 

LSNNSLVSLT YVSFRNLTHL ESLHLEDNAL KVLHNGTLAE LQGLPHIRVF LDNNPWVCDC 300 

HMADMVTWLK ETEWQGKDR LTCAYPEKMR NRVLLELNSA DLDCDPILPP SLQTSYVFLG 360 
IVLALIGAIF LLVLYLNRKG IKKWMHNIRD ACRDHMEGYH YRYEINADPR LTNLSSNSDV 

60 Seq ID NO: 194 DNA Sequence 

Nucleic Acid Accession #: NM_014400 
Coding sequence: 86.. 1126 

65 1 11 21 31 41 51 

1 I i I I I 

GGTTACTCAT CCTGGGCTCA GGTAAGAGGG CCCGAGCTCG GAGGCGGCAC ACCCAGGGGG 60 

GACGCCAAGG GAGCAGGACG GAGCCATGGA CCCCGCCAGG AAAGCAGGTG CCCAGGCCAT 120 

GATCTGGACT GCAGGCTGGC TGCTGCTGCT GCTGCTTCGC GGAGGAGCGC AGGCCCTGGA 180 

70 GTGCTACAGC TGCGTGCAGA AAGCAGATGA CGGATGCTCC CCGAACAAGA TGAAGACAGT 240 

GAAGTGCGCG CCGGGCGTGG ACGTCTGCAC CGAGGCCGTG GGGGCGGTGG AGACCATCCA 300 

CGGACAATTC TCGCTGGCAG TGCSGGGTTG CGGTTCGGGA CTCCCCGGCA AGAATGACCG 360 

CGGCCTGGAT CTTCACGGGC TTCTGGCGTT CATCCAGCTG CAGCAATGCG CTCAGGATCG 420 

CTGCAACGCC AAGCTCAACC TCACCTCGCG GGCGCTCGAC CCGGCAGGTA ATGAGAGTGC 480 

75 ATACCCGCCC AACGGCGTGG AGTGCTACAG CTGTGTGGGC CTGAGCCGGG AGGCGTGCCA 540 

GGGTACATCG CCGCCGGTCG TGAGCTGCTA CAACGCCAGC GATCATGTCT ACAAGGGCTG 600 

CTTCGACGGC AACGTCACCT TGACGGCAGC TAATGTGACT GTGTCCTTGC CTGTCCGGGG 660 

CTGTGTCCAG GATGAATTCT GCACTCGGGA TGGAGTAACA GGCCCAGGGT TCACGCTCAG 720 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



TGGCTCCTGT 
CCCTCGAATC 
CACATCTGTC 
GCCAGCGCCA 
GGAGCCCAGG 
TCCTGCAAAA 
ATTGGCAGCC 
AAATTTCCCT 
CCCACCACTG 
CTTCTGCTGC 
GGGTGTTCTA 
TCCTCTTGTG 
AGGATGCTAA 
GGTGGGACAA 
ATCGGTTCCC 
CTTATGTCTG 
TTGTATAGTG 



TGCCAGGGGT 
CCACCCCTTG 
ACCACTTCTA 
ACCAGTCAGA 
TTGACTGGAG 
GGGGGGCCCC 
CTTCTGTTGG 
CTCACCTACT 
GACTGGGCTG 
GCTGGTTTGC 
GCTTTTTGAG 
ATGTTAGGAC 
GCTTCCTACT 
TGGCTCCCCA 
CATATGTCTT 
TGTGTGATCA 
AAAAAAAA 



CCCGCTGTAA 
TCCGGCTGCC 
CCTCGCCCCC 
CTCCGAGACA 
GCGCCGCTGG 
AGCAGCCCCA 
CCGTGGCTGC 
TCTCTGGCCC 
GCCCAGCCCC 
GGCTTTGGGA 
GACAGCTCCT 
AGAGTGAGAG 
CACTTTCTCC 
CTCTAAGCAC 
CCTTACTAGA 
GTTTCTGGCA 



CTCTGACCTC 
CCCTCCAGAG 
AGTGAGACCC 
GGGAGTAGAA 
CCACCAGGAC 
TAATAAAGGC 
TGGTGTCCTA 
TGGGTACCCC 
TGTTTTTCCA 
AATAAAATAC 
GTATCCTTCT 
AAGTCAGCTG 
TAGCCAGCCT 
TGCCTCCCCT 
CTGTGAGCTC 
CATAAATGCC 



CGCAACAAGA 
CCCACGACTG 
ACATCCACCA 
CACGAGGCCT 
CGCAGCAATT 
TGTGTGGCTC 
CTGTGAGCTT 
TCTTCTCATC 
ACATTCCCCA 
CGTTG1ATAT 
CATCCTTGTC 
TCACGGGGAA 
GGACTTTGGA 
ACTCCCCGCA 
CTCGAGGGCA 
TCAATAAAGA 



CCTACTTCTC 
TGGCCTCAAC 
CCAAACCCAT 
CCCGGGATGA 
CAGGGCAGTA 
CCACAGCTGG 
CTCCACCTGG 
ACTTCCTGTT 
GTATCCCCAG 
ATTCTGGCAG 
TCTCCGCTTG 
GGTGAGAGAG 
GCGTGGGGTG 
TCTTTGGGGA 
GGGACCGTGC 
TTTAATTACT 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



Seq ID NO: 195 Protein Sequence 
Protein Accession ft: NP_055215 



1 
I 

MDPARKAGAQ 
CTEAVGAVET 
SRALDPAGNE 
AANVTVSLPV 
LPPPEPTTVA 
AGHQDRSNSG 



11 
I 

AMIWTAGWLL 
IHGQFSLAVX 
SAYPPNGVEC 
RGCVQDEFCT 
STTSVTTSTS 
QYPAKGGPQQ 



21 31 41 51 

■III 

LLLLRGGAQA LECYSCVQKA DDGCSPNKMK TVKCAPGVDV 60 
GCGSGLPGKN DRGLDLHGLL AFIQLQQCAQ DRCNAKLNLT 120 
YSCVGLSREA CQGTSPPWS CYNASDHVYK GCPDGNVTLT 180 
RDGVTGPGFT LSGSCCQGSR CNSDLRNKTY FSPRIPPLVR 240 
APVRPTSTTK PMPAPTSQTP RQGVEHEASR DEEPRLTGGA 300 
PHNKGCVAPT AGLAALLLAV AAGVLL 



Seq ID NO: 196 DNA Sequence 
Nucleic Acid Accession ft: NM_006536 
.Coding sequence: 109.. 2940 



ACCTAAAACC 
ATGTATGCAG 
AGCATTGCAG 
GAACTCCCAT 
ATTGCAATTA 
ATAACTGAAG 
ATAAAGATTT 
TCATATGAAA 
TACACCCTAC 
TTCCTACTGA 
GAATGGGCCC 
ATAAATGGGC 
GTGTGTGAAA 
GGATGCACCT 
AGTTTATCTT 
CTACAGAACC 
TTTCACCACA 
GTACAGGCTG 
GCTGACAGAC 
ATTCATACCT 
CACCAAATTA 
TCAGCTAAAA 
AAACTGAATG 
CTTCTTGGCA 
CTGGGTTCAT 
TTCTTTGTTC 
TCTGGAACTG 
AAACCTCACC 
ATGTTTCTAG 
GGACGAAAAT 
TGGATTCCAG 
TCTCTGCAAG 
GCCACTGTGG 
TATGCCAATG 
GAGCCAGAGA 
GTTATAAAAA 
TATAGCTTGA 
CCAGGGAGTC 
GCTCCAAGGA 
AGCTCAGGAG 
CCACCATGCA 
TGGACAGCAC 



11 

I 

TTGCAAGTTC 
CAGGCTCAGT 
GTCCTATTTG 
TCCTGGGAGC 
ATCCTCAGGT 
CTTCATTTTA 
TAATACCTGC 
AGGCAAATGT 
AATACAGAGG 
ATGATAACTT 
ACCTCCGTTG 
AAAATCAAAT 
AAGGTCCTTG 
TTATCTACAA 
CTGTGGTTGA 
AGATGTGCAG 
GCTTTCCCAT 
GTGACAAAGT 
TCCTTCAACT 
TCGTGGGCAT 
ACAGCAATGA 
CAGACATCAG 
GAAAAGCTTA 
ATTGCTTACC 
CTGCAGCCCC 
CAGATATATC 
GAGACATTTT 
ATCAATTGAA 
TTACGTGGCA 
ACTACACAAA 
GAACAGCTAA 
CCCTGAAAGT 
AAGCCTTTGT 
TGAAACAGGG 
CTGGAGATCC 
ATGATGGAAT 
AAGTGCATGT 
ATGCTATGTA 
AATCAGTAGG 
GCTCCTTTTC 
AAATTATTGA 
CTGGAGAAGA 



21 
I 

AGGAAGAAAC 
GTGAGTGAAC 
CAACCTGAAG 
TGGAGTACAG 
ACCTGAGAAT 
CCTATTTAAT 
CACATGGAAA 
CATAGTGACT 
GTGTGGAAAA 
AACAGCTGGC 
GGGTGTGTTC 
TAAAGTGACA 
CCCCCAAGAA 
TAGCACCCAA 
ATTTTGTAAT 
CCTCAGAAGT 
GAATGGGACT 
GGTCTGTTTA 
ACAACAAGCC 
TGCCAGTTTC 
TGATCGAAAG 
CATTTGTTCA 
TGGCTCTGTG 
CACTGTGCTC 
AAATCTGGAG 
AAACTCCAAT 
CCAGCAACAT 
AAACACAGTG 
GGCCAGTGGT 
TAATTTTATC 
GCCTGGGCAC 
GACAGTGACC 
GGAAAGAGAC 
ATTTTATCCC 
TGTTACGCTG 
TTACTCGAGG 
CAATCACTCT 
TGTACCAGGT 
CAGAAATGAG 
AGTGCTGGGA 
CCTGGAAGCT 
CTTTGATCAG 



31 
I 

CATCTGCATC 
TGGAGGCTTC 
TTTGTGACTC 
CTTCAAGACA 
CAGAACCTCA 
GCTACCAAGA 
GCTAATAATA 
. GACTGGTATG 
GAGGGAAAAT 
TACGGATCAC 
GATGAGTATA 
AGGTGTTCAT 
AACTGTATTA 
AATGCAACTG 
GCAAGTACCC 
GCATGGGATG 
GAGCTTCCAC 
GTGCTGGATG 
GCAGAATTTT 
GACAGCAAAG 
TTGCTGGTTT 
GGGCTTAAGA 
ATGATATTAG 
AGCAGTGGTT 
GAATTATCAC 
AGCATGATTG 
ATTCAGCTTG 
ACTGTGGATA 
CCTCCTGAGA 
ACCAATCTAA 
TGGACTTACA 
TCTCGCGCCT 
AGCCTCCATT 
ATTCTTAATG 
AGACTCCTTG 
TATTTTTTCT 
CCCAGCATAA 
TACACAGCAA 
GAGGAGCGAA 
GTTCCAGCTG 
GTAAAAGTAG 
GGCCAGGCTA 



41 

I 

CATATTGAAA 
TCTACAACAT 
TCCTGGTTGC 
ATGGGTATAA 
TCTCAAACAT 
GAAGAGTATT 
ACAGCAAAAT 
GGGCACATGG 
ACATTCATTT 
GAGGCCGAGT 
ACAATGACAA 
CTGACATCAC 
TTAGTAAGCT 
CATCAATAAT 
ACAACCAAGA 
TAATCACAGA 
CTCCTCCCAC 
TGTCCAGCAA 
ATTTGATGCA 
GAGAGATCAG 
CATATCTGCC 
AAGGATTTGA 
TGACCAGCGG 
CAACAATTCA 
GTCTTACAGG 
ATGCTTTCAG 
AAAGTACAGG 
ATACTGTGGG 
TTATATTATT 
CTTTTCGGAC 
CCCTGAACAA 
CCAACTCAGC 
TTCCTCATCC 
CCACTGTCAC 
ATGATGGAGC 
CCTTTGCTGC 
GCACCCCAGC 
ACGGTAATAT 
AGTGGGGCTT 
GCCCCCACCC 
AAGAGGAATT 
CAAGCTATGA 



51 
I 

ACCTGACACA 
GACCCAAAGG 
CTTAAGTTCA 
TGGATTGCTC 
TAAGGAAATG 
TTTCAGAAAT 
AAAACAAGAA 
AGATGATCCA 
CACACCTAAT 
GTTTGTCCAT 
ACCTTTCTAC 
AGGCATTTTT 
TTTTAAAGAA 
GTTCATGCAA 
AGCACCAAAC 
CTCTGCTGAC 
ATTCTCGCTT 
GATGGCAGAG 
GATTGTTGAA 
AGCCCAGCTA 
CACCACTGTA 
GGTGGTTGAA 
AGATGATAAG 
CTCCATTGCC 
AGGTTTAAAG 
TAGAATTTCC 
TGAAAATGTC 
CAACGACACT 
TGATCCTGAT 
AGCTAGTCTT 
TACCCATCAT 
TGTGCCCCCA 
TGTGATGATT 
TGCCACAGTT 
AGGTGCTGAT 
AAATGGTAGA 
CCACTCTATT 
TCAGATGAAT 
TAGCCGAGTC 
TGATGTGTTT 
GACCCTATCT 
AATAAGAATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
72 0 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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AGTAAAAGTC TACAGAATAT CCAAGATGAC TTTAACAATG CTATTTTAGT AAATACATCA 2580 

AAGCGAAATC CTCAGCAAGC TGGCATCAGG GAGATATTTA CGTTCTCACC CCAGATTTCC 2640 

ACGAATGGAC CTGAACATCA GCCAAATGGA GAAACACATG AAAGCCACAG AATTTATGTT 2700 

GCAATACGAG CAATGGATAG GAACTCCTTA CAGTCTGCTG TATCTAACAT TGCCCAGGCG 2760 

CCTCTGTTTA TTCCCCCCAA TTCTGATCCT GTACCTGCCA GAGATTATCT TATATTGAAA 2820 

GGAGTTTTAA CAGCAATGGG TTTGATAGGA ATCATTTGCC TTATTATAGT TGTGACACAT 2880 

CATACTTTAA GCAGGAAAAA GAGAGCAGAC AAGAAAGAGA ATGGAACAAA ATTATTATAA 2940 

ATAAATATCC AAAGTGTCTT CCTTCTTAGA TATAAGACCC ATGGCCTTCG ACTACAAAAA 3000 

CATACTAACA AAGTCAAATT AACATCAAAA CTGTATTAAA ATGCATTGAG TTTTTGTACA 3060 

ATACAGATAA GATTTTTACA TGGTAGATCA ACAATTCTTT TTGGGGGTAG ATTAGAAAAC 3120 

CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT TTAAAGTAAT GTCTTTAAAG 3180 

GCAAAGGGAA GGGTAAAGTC GGACCAGTGT CAAGGAAAGT TTGTTTTATT GAGGTGGAAA 3240 

AATAGCCCCA AGCAGAGAAA AGGAGGGTAG GTCTGCATTA TAACTGTCTG TGTGAAGCAA 3300 

TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC TGTGCAGTAC AGGTTGCTTG 3360 

TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC TAATGCAAAG CTCTTTACCT 3420 

CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT GCTAATGCTC AGAGATCTTT 3480 

TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC CTTTGTCTCT TCATACCGGT 3540 

TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT TCTACTCCCA TCAAAGCAGC 3600 

TTTCTAAGTT TATTGCCTTG GGTTATTATG GAATGATAGT TATAGCCCCN TATAATGCCT 3660 
TACCTAGGAA A 

Seq ID NO: 197 Protein Sequence 
Protein Accession #: NP_006527 

1 11 21 31 41 51 

I I I I I I 

MTQRSIAGPI CNLKFVTLLV ALSSELPFLG AGVQLQDNGY NGLLIAINPQ VPENQNLISN 60 

IKEMITEASF YLFNATKRRV FFRNIKILIP ATWKANNNSK IKQESYEKAN VIVTDWYGAH 120 

GDDPYTLQYR GCGKEGKYIH FTPNFLLNDN LTAGYGSRGR VFVHEWAHLR WGVFDEYNND 180 

KPFYINGQNQ IKVTRCSSDI TGIFVCEKGP CPQENCIISK LFKEGCTFIY NSTQNATASI 240 

MFMQSLSSW EFCNASTHNQ EAPNLQNQMC SLRSAWDVIT DSADFHHSFP MNGTELPPPP 300 

TFSLVOAGDK WCLVLDVSS KMAEADRLLQ LQQAAEFYLM QIVEIHTFVG IASFDSKGEI 360 

RAQLHQINSN DDRKLLVSYL PTTVSAKTDI SICSGLKKGF EWEKLNGKA YGSVMILVTS 420 

GDDKLLGNCL PTVLSSGSTI HSIALGSSAA PNLEELSRLT GGLKFFVPDI SNSNSMIDAF 480 

SRISSGTGDI FQQHIQLEST GENVKPHHQL KNTVTVDNTV GNDTMFLVTW QASGPPEIIL 540 

FDPDGRKYYT NNFITNLTFR TASLWIPGTA KPGHWTYTLN NTHHSLQALK VTVTSRASNS 600 

AVPPATVEAF VERDSLHFPH PVMIYANVKQ GFYPILNATV TATVEPETGD PVTLRLLDDG 660 

AGADVIKNDG IYSRYFFSFA ANGRY SLKVH VNHSPSISTP AHSIPGSHAM YVPGYTANGN 720 

IQMNAPRKSV GRNEEERKWG FSRVSSGGSF SVLGVPAGPH PDVFPPCKII DLEAVKVEEE 780 

LTLSWTAPGE DFDQGQATSY EIRMSKSLQN IQDDFNNAIL VNTSKRNPQQ AGIREIFTFS 840 

PQISTNGPEH QPNGETHESH RIYVAIRAMD RNSLQSAVSN IAQAPLFIPP NSDPVPARDY 900 
LILKGVLTAM GLIGIICLII WTHHTLSRK KRADKKENGT KLL 



Seq ID NO: 198 DNA Sequence 
Nucleic Acid Accession #: NM_001944 
Coding sequence: 84.. 3 083 

1 11 21 31 41 51 

I I I I I I 

TTTTCTTAGA CATTAACTGC AGACGGCTGG CAGGATAGAA GCAGCGGCTC ACTrGGACTT 60 

TTTCACCAGG GAAATCAGAG ACAATGATGG GGCTCTTCCC CAGAACTACA GGGGCTCTGG 120 

CCATCTTCGT GGTGGTCATA TTGGTTCATG GAGAATTGCG AATAGAGACT AAAGGTCAAT 180 

ATGATGAAGA AGAGATGACT ATGCAACAAG CTAAAAGAAG GCAAAAACGT GAATGGGTGA 240 

AATTTGCCAA ACCCTGCAGA GAAGGAGAAG ATAACTCAAA AAGAAACCCA ATTGCCAAGA 300 

TTACTTCAGA TTACCAAGCA ACCCAGAAAA TCACCTACCG AATCTCTGGA GTGGGAATCG 360 

ATCAGCCGCC TTTTGGAATC TTTGTTGTTG ACAAAAACAC TGGAGATATT AACATAACAG 420 

CTATAGTCGA CCGGGAGGAA ACTCCAAGCT TCCTGATCAC ATGTCGGGCT CTAAATGCCC 480 

AAGGACTAGA TGTAGAGAAA CCACTTATAC TAACGGTTAA AATTTTGGAT ATTAATGATA 540 

ATCCTCCAGT ATTTTCACAA CAAATTTTCA TGGGTGAAAT TGAAGAAAAT AGTGCCTCAA 600 

ACTCACTGGT GATGATACTA AATGCCACAG ATGCAGATGA ACCAAACCAC TTGAATTCTA 660 

AAATTGCCTT CAAAATTGTC TCTCAGGAAC CAGCAGGCAC ACCCATGTTC CTCCTAAGCA 720 

GAAACACTGG GGAAGTCCGT ACTTTGACCA ATTCTCTTGA CCGAGAGCAA GCTAGCAGCT 780 

ATCGTCTGGT TGTGAGTGGT GCAGACAAAG ATGGAGAAGG ACTATCAACT CAATGTGAAT 840 

GTAATATTAA AGTGAAAGAT GTCAACGATA ACTTCCCAAT GTTTAGAGAC TCTCAGTATT 900 

CAGCACGTAT TGAAGAAAAT ATTTTAAGTT CTGAATTACT TCGATTTCAA GTAACAGATT 960 

TGGATGAAGA GTACACAGAT AATTGGCTTG CAGTATATTT CTTTACCTCT GGGAATGAAG 1020 

GAAATTGGTT TGAAATACAA ACTGATCCTA GAACTAATGA AGGCATCCTG AAAGTGGTGA 1080 

AGGCTCTAGA TTATGAACAA CTACAAAGCG TGAAACTTAG TATTGCTGTC AAAAACAAAG 1140 

CTGAATTTCA CCAATCAGTT ATCTCTCGAT ACCGAGTTCA GTCAACCCCA GTCACAATTC 1200 

AGGTAATAAA TGTAAGAGAA GGAATTGCAT TCCGTCCTGC TTCCAAGACA TTTACTGTGC 1260 

AAAAAGGCAT AAGTAGCAAA AAATTGGTGG ATTATATCCT GGGAACATAT CAAGCCATCG 1320 

ATGAGGACAC TAACAAAGCT GCCTCAAATG TCAAATATGT CATGGGACGT AACGATGGTG 1380 

GATACCTAAT GATTGATTCA AAAACTGCTG AAATCAAATT TGTCAAAAAT ATGAACCGAG 1440 

ATTCTACTTT CATAGTTAAC AAAACAATCA CAGCTGAGGT TCTGGCCATA GATGAATACA 1500 

CGGGTAAAAC TTCTACAGGC ACGGTATATG TTAGAGTACC CGATTTCAAT GACAATTGTC 1560 

CAACAGCTGT CCTCGAAAAA GATGCAGTTT GCAGTTCTTC ACCTTCCGTG GTTGTCTCCG 1620 

CTAGAACACT GAATAATAGA TACACTGGCC CCTATACATT TGCACTGGAA GATCAACCTG 1680 



298 



WO 03/003906 



41 51 



TAAAGTTGCC TGCCGTATGG AGTATCACAA CCCTCAATGC TACCTCGGCC CTCCTCAGAG 174 0 

CCCAGGAACA GATACCTCCT GGAGTATACC ACATCTCCCT GGTACTTACA GACAGTCAGA 1800 

ACAATCGGTG TGAGATGCCA CGCAGCTTGA CACTGGAAGT CTGTCAGTGT GACAACAGGG 1860 

AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 1920 

SS™^ GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

^^!?' TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2040 

GTCGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT TCATCAGTGG GGAATTGAAG 2100 

TGAAGACAAG GAAATCACAA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 2160 

GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGCGG 2220 

TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT GAATCTGGAG 2280 

GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GGAGCAGCCA 2340 

STSf^ CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 2400 

GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 24 60 

TTTOCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG CCAGGAAGCA AATGACTGCT 2520 

TGTTGATCTA TGATAATGAA GGCGCAGATG CCACTGGTTC TCCTGTGGGC TCCGTGGGTT 2580 

GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT GGACCCAAAT 2640 

TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA GTTCAGCCAC 2700 

CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA GTCCAGCAGA 2760 

CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG AGCTTCTGCT TTGTCCGCCT 2820 

CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT GCAGCATGGT AACTATTTAG 2880 

TAACGGAGAC TTACTCGGCT TCTGGTTCCC TCGTGCAACC TTCCACTGCA GGCTTTGATC 2940 

CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCCATT TCCAGTGTTC 3000 

CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC TGTACAGAGG 3060 

ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 3120 

ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG GCTAATAATT 3180 

TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT AAATTAAATG TTTGGGTTCA 3240 

TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA TTGTAGTAAA 3300 
TCTTAAAGTT TTTCAAAACC CTAAAATCAT ATTCGC 

Seq ID NO: 199 Protein Sequence 
Protein Accession #: NP_001935 



1 f 1 ?* 31 

1 I i I I ! 

MMGLFPRTTG ALAIFVWIL VHGELRIETK GQYDEEEMTM QQAKRRQKRE WVKFAKPCRE 60 

GEDNSKRNPI AKITSDYQAT QKITYRISGV GIDQPPFGIF WDKNTGDIN ITAIVDREET 120 

PSFLITCRAL NAQGLDVEKP LILTVKILDI NDNPPVFSQQ IFMGEIEENS ASNSLVMILN 180 

ATDADEPNHL NSKIAFKIVS QEPAGTPMFL LSRNTGEVRT LTNSLDREQA SSYRLWSGA 240 

DKDGEGLSTQ CECNIKVKDV NDNFPMFRDS QYSARIEENI LSSELLRFQV TDLDEEYTDN 300 

WLAVYFFTSG NEGNWFEIQT DPRTNEGILK WKALDYEQL QSVKLSIAVK NKAEFHQSVI 360 

SRYRVQSTPV TIQVINVREG IAFRPASKTF TVQKGISSKK LVDYILGTYQ AIDEDTNKAA 420 

SNVKYVMGRN DGGYLMIDSK TAEIKFVKNM NRDSTFIVNK TITAEVLAID EYTGKTSTGT 480 

VYVRVPDFND NCPTAVLEKD AVCSSSPSW VSARTLNNRY TGPYTFALED OPVKLPAVWS 540 

ITTLNATSAL LRAQEQIPPG VYHISLVLTD SQNNRCEMPR SLTLEVCQCD NRGICGTSYP 600 
TTSPGTRYGR PHSGRLGPAA IGLLLLGLLL LLLAPLLLLT CDCGAGSTGG VTGGFIPVPD 
GSEGTIHQWG IEGAHPEDKE ITNICVPPVT ANGADFMESS EVCTNTYARG TAVEGTSGME 
MTTKLGAATE SGGAAGFATG TVSGAASGFG AATGVGICSS GQSGTMRTRH STGGTNKDYA 

DGAISMNFLD SYFSQKAFAC AEEDDGQEAN DCIiLIYDNEG ADATGSPVGS VGCCSFIADD 840 

LDD5FLDSLG PKFKKLAEIS LGVDGEGKEV QPPSKDSGYG IESCGHPIBV QQTGFVKCOT 900 

LSGSQGASAL SASGSVQPAV SIPDPLQHGN YLVTETYSAS GSIiVQPSTAG FDPLLTQNVI 960 
VTERVICPIS SVPGNLAGPT QLRGSHTMLC TEDPCSRLI 



660 
720 
780 
840 



Seq ID NO: 200 DNA Sequence 
Nucleic Acid Accession #: NM_020411 
Coding sequence: 86. .526 

? r r v r r 

GGACCTGGGA AGGAGCATAG GACAGGGCAA GGCGGGATAA GGAGGGGCAC CACAGCCCTT 60 

AAGGCACGAG GGAACCTCAC TGCGCATGCT CCTTTGGTGC CCACCTCAGT GCGCATGTTC 120 

ACTGGGCGTC TTCCCATCGG CCCCTTCGCC AGTGTGGGGA ACGCGGCGGA GCTGTGAGCC 180 

GGCGACTCGG GTCCCTGAGG TCTGGATTCT TTCTCCGCTA CTGAGACACG GCGGACACAC 240 

ACAAACACAG AACCACACAG CCAGTCCCAG GAGCCCAGTA ATGGAGAGCC CCAAAAAGAA 300 

GAACCAGCAG CTGAAAGTCG GGATCCTACA CCTGGGCAGC AGACAGAAGA AGATCAGGAT 360 

ACAGCTGAGA TCCCAGTGCG CGACATGGAA GGTGATCTGC AAGAGCTGCA TCAGTCAAAC 420 

ACCGGGGATA AATCTGGATT TGGGTTCCGG CGTCAAGGTG AAGATAATAC CTAAAGAGGA 480 

ACACTGTAAA ATGCCAGAAG CAGGTGAAGA GCAACCACAA GTTTAAATGA AGACAAGCTG 540 

AAACAACGCA AGCTGGTTTT ATATTAGATA TTTGACTTAA ACTATCTCAA TAAAGTTTTG 600 
CAGCTTTCAC CAAAAAAAAA AAAAAA 

Seq ID NO: 201 Protein Sequence 
Protein Accession #: NP_065144.1 

] ^ 21 31 41 51 

II I I I I 

MLLWCPPQCA CSLGVFPSAP SPVWGTRRSC EPATRVPEVW ILSPLLRHGG HTQTQNHTAS 60 
PRSPVMESPK KKNQQLKVGI LHLGSRQKKI RIQLRSQCAT WKVICKSCIS QTPGINLDLG 120 
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SGVKVKIIPK EEHCKMPEAG EEQPQV 

Seq ID NOi 202 DNA Sequence 
Nucleic Acid Accession #: AA172056 
Coding sequence: 121.. 339 



1 11 21 31 41 51 

1 I I I I I 

TTTAGCCACC AGAGGANTTC TCTTGAAATA CCCAAAATCC ATCAGTATCT TGAATCATGC 60 

TGGATTTTGA AGAATTCTTA AGAAGCCATG TAAAGGGGGC TCTCTGGCCT TGAAATAGTG 120 

ATGTTTTTTA TACAGAAAGG AGAATGCAGA ATGGTCAGAC TATCATGCAC TGTTAAATTT 180 

GATTTCAAGA AATTACAGGA AAACTTTCCA AAGTTCCATC TCACAGAANN TTATTTTNCC .240 

AAGAATTCCA AGATAAGTTT AGTTTTATGG AAGACTTTTA TGTGGTTTTT ACTCACTCTT 300 

CATCTCAGAC ATCGACAGAT GATTACATCA CTTATAGTTC TAGTAAATTT ATTAATATAA 360 

AACTCAGAGA CATTCCAATA TCCACATTGC TTACACCATT AGGCATAGAT TCAGTGTCAG 420 

CTATGACAAT TGAAAATGAG CTGTTTTGTG ATTTAAAGGT TTAAATTTCT CTAACCAAAC 400 

TGCTTGATCC AGATGCAGGA CTGCAAATGT TAATATTTGT TCTGGAAGAA CAATCAAATA 540 

AGACTTAAGA GGAAAGGGAA TGGCCACAAT CCACCTGAAA TTTTTTCTTA AAAAGTGTGC 600 

AGCCTACTAA ATCAGAATGA AAATAGAAGT ACAAGATTAT AAACAAAATG CAATCAAACT 660 

TTTCTTAAGC TTACCTAAAG TTATTTCATC TGAAAATTTC AAGCAACTTT GTTCAACATT 720 

AAATTGACAA TCTAAACTAA CAAGTCTTTT GAATTTATGC ATGGTAGTAA ACATTCTCTC 780 

TATTAACTTT ATTACCTAAG GCTAAACCTA AAATTTTTAA GCAAAATTAG AAAAATAGTC 840 

TTCACTCATC AAAAAATAAA GTTTGTTACA TTTAGTATTT TCCCAATAAA ATTGGTCGTT 900 

CTTGGTTTTT TATTTGGAGA GTCTGTGCAA AATGTCACTA AAAATAAATT AGCACTAGAA 960 
ATTATTTCTA AATACCAAA 



Seq ID NO: 203 DNA Sequence 

Nucleic Acid Accession #: NM_005656.1 

Coding sequence: 57.. 1535 



1 11 21 31 41 SI 

1 I i I I I 

GTCATATTGA ACATTCCAGA TACCTATCAT TACTCGATGC TGTTGATAAC AGCAAGATGG 60 

CTTTGAACTC AGGGTCACCA CCAGCTATTG GACCTTACTA TGAAAACCAT GGATACCAAC 120 

CGGAAAACCC CTATCCCGCA CAGCCCACTG TGGTCCCCAC TGTCTACQAG GTGCATCCGG 180 

CTCAGTACTA CCCGTCCCCC GTGCCCCAGT ACGCCCCGAG GGTCCTGACG CAGGCTTCCA 240 

ACCCCGTCGT CTGCACGCAG CCCAAATCCC CATCCGGGAC AGTGTGCACC TCAAAGACTA 300 

AGAAAGCACT GTGCATCACC TTGACCCTGG GGACCTTCCT CGTGGGAGCT GCGCTGGCCG 360 

CTGGCCTACT CTGGAAGTTC ATGGGCAGCA AGTGCTCCAA CTCTGGGATA GAGTGCGACT 420 

CCTCAGGTAC CTGCATCAAC CCCTCTAACT GGTGTGATGG CGTGTCACAC TGCCCCGGCG 4 80 

GGGAGGACGA GAATCGGTGT GTTCGCCTCT ACGGACCAAA CTTCATCCTT CAGATGTACT 540 

CATCTCAGAG GAAGTCCTGG CACCCTGTGT GCCAAGACGA CTGGAACGAG AACTACGGGC 600 

GGGCGGCCTG CAGGGACATG GGCTATAAGA ATAATTTTTA CTCTAGCCAA GGAATAGTGG 660 

ATGACAGCGG ATCCACCAGC TTTATGAAAC TGAACACAAG TGCCGGCAAT GTCGATATCT 720 

ATAAAAAACT GTACCACAGT GATGCCTGTT CTTCAAAAGC AGTGGTTTCT TTACGCTGTT 7 80 

TAGCCTGCGG GGTCAACTTG AACTCAAGCC GCCAGAGCAG GATCGTGGGC GGTGAGAGCG 840 

CCCTCCCGGG GGCCTGGCCC TGGCAGGTCA GCCTGCACGT CCAGAACGTC CACGTGTGCG 900 

GAGGCTCCAT CATCACCCCC GAGTGGATCG TGACAGCCGC CCACTGCGTG GAAAAACCTC 960 

TTAACAATCC ATGGCATTGG ACGGCATTTG CGGGGATTTT GAGACAATCT TTCATGTTCT 1020 

ATGGAGCCGG ATACCAAGTA CAAAAAGTGA TTTCTCATCC AAATTATGAC TCCAAGACCA 1080 

AGAACAATGA CATTGCGCTG ATGAAGCTGC AGAAGCCTCT GACTTTCAAC GACCTAGTGA 1140 

AACCAGTGTG TCTGCCCAAC CCAGGCATGA TGCTGCAGCC AGAACAGCTC TGCTGGATTT 12 00 

CCGGGTGGGG GGCCACCGAG GAGAAAGGGA AGACCTCAGA AGTGCTGAAC GCTGCCAAGG 1260 

TGCTTCTCAT TGAGACACAG AGATGCAACA GCAGATATGT CTATGACAAC CTGATCACAC 1320 

CAGCCATGAT CTGTGCCGGC TTCCTGCAGG GGAACGTCGA TTCTTGCCAG GGTGACAGTG 1380 

.GAGGGCCTCT GGTCACTTCG AACAACAATA TCTGGTGGCT GATAGGGGAT ACAAGCTGGG 1440 

GTTCTGGCTG TGCCAAAGCT TACAGACCAG GAGTGTACGG GAATGTGATG GTATTCACGG 1500 

ACTGGATTTA TCGACAAATG AAGGCAAACG GCTAATCCAC ATGGTCTTCG TCCTTGACGT 1560 

CGTTTTACAA GAAAACAATG GGGCTGGTTT TGCTTCCCCG TGCATGATTT ACTCTTAGAG 1620 

ATGATTCAGA GGTCACTTCA TTTTTATTAA ACAGTGAACT TGTCTGGCTT TGGCACTCTC 1680 

TGCCATACTG TGCAGGCTGC AGTGGCTCCC CTGCCCAGCC TGCTCTCCCT AACCCCTTGT 1740 

CCGCAAGGGG TGATGGCCGG CTGGTTGTGG GCACTGGCGG TCAATTGTGG AAGGAAGAGG 1800 

GTTGGAGGCT GCCCCCATTG AGATCTTCCT GCTGAGTCCT TTCCAGGGGC CAATTTTGGA 1860 

TGAGCATGGA GCTGTCACTT CTCAGCTGCT GGATGACTTG AGATGAAAAA GGAGAGACAT 1920 

GGAAAGGGAG ACAGCCAGGT GGCACCTGCA GCGGCTGCCC TCTGGGGCCA CTTGGTAGTG 1980 

TCCCCAGCCT ACTTCACAAG GGGATTTTGC TGATGGGTTC TTAGAGCCTT AGCAGCCCTG 2040 

GATGGTGGCC AGAAATAAAG GGACCAGCCC TTCATGGGTG GTGACGTGGT AGTCACTTGT 2100 

AAGGGGAACA GAAACATTTT TGTTCTTATG GGGTGAGAAT ATAGACAGTG CCCTTGGTGC 2160 

GAGGGAAGCA ATTGAAAAGG AACTTGCCCT GAGCACTCCT GGTGCAGGTC TCCACCTGCA 2220 

CATTGGGTGG GGCTCCTGGG AGGGAGACTC AGCCTTCCTC CTCATCCTCC CTGACCCTGC 2280 

TCCTAGCACC CTGGAGAGTG AATGCCCCTT GGTCCCTGGC AGGGCGCCAA GTTTGGCACC 2340 

ATGTCGGCCT CTTCAGGCCT GATAGTCATT GGAAATTGAG GTCCATGGGG GAAATCAAGG 2400 

ATGCTCAGTT TAAGGTACAC TGTTTCCATG TTATGTTTCT ACACATTGAT GGTGGTGACC 2460 
CTGAGTTCAA AGCCATCTT 



Seq ID NO: 204 Protein Sequence 
Protein Accession #: NP_005647.l 
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MAIiNSGSPPA IGPYYENHGY 
SNPWCTQPK SPSGTVCTSK 
DSSGTCINPS NWCDGVSHCP 
GRAACRDMGY KNNFYSSQGI 
CLACGVNLNS SRQSRIVGGE 
PLNNPWHWTA FAGILRQSFM 
VKPVCLPNPG MMLQPEQLCW 
TPAMICAGFL QGNVDSCQGD 
TDWIYRQMKA NG 



31 
I 

QPENPYPAQP 
TKKALCITLT 
GGEDENRCVR 
VDDSGSTSFM 
SALPGAWPWQ 
FYGAGYQVQK 
ISGWGATEEK 
SGGPLVTSNN 



41 
I 

TWPTVYEVH 
U3TFLVGAAL 
LYGPNFILOM 
KLNTSAGNVD 
VSLHVQNVHV 
VISHPNYDSK 
GKTSEVLNAA 
NIWWLIGDTS 



51 
I 

PAQYYPSPVP 
AAGLLWKFMG 
YSSQRKSWHP 
IYKKLYHSDA 
CGGSIITPEW 
TKNNDIALMK 
KVLLIETQRC 
WGSGCAKAYR 



QYAPRVLTQA 
SKCSKSGIEC 
VCQDDWNENY 
CSSKAWSLR 
IVTAAHCVEK 
LQKPLTFNDL 
NSRYVYDNLI 
PGVYGNVMVF 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO: 205 DNA Sequence 
Nucleic Acid Accession ft: XM 044533 
Coding sequence: 238.. 2751 



I 

GCTCTGCCCA 
AGGGGCTGAG 
GGAGGCGGGG 
AGTCCGGCCG 
CTGCGCACCG 
CGGCCACCGC 
TGGGCGCTCA 
TTCGAAGCTG 
CTGTACGTGG 
GGCGGGGAGT 
TTCAAGGGCA 
AGCGGCAGTC 
AACATGGAGA 
AAGGGCCGTT 
CTCTACACTG 
AGCCTTCGCC 
GCCTCAGCCT 
TTTTTCTTCA 
ATTGCCCGCA 
TCCTTCCTCA 
CTGCAGGATG 
GGGGTCTTCA 
ACAATGAAGG 
CAGCAGTGGT 
AACAGTGCCC 
TTCCTCAAGG 
CCCCAGGCTC 
GATGTCCTCT 
CGGGTGCACA 
CTCCTGGACA 
CCCATGGCCA 
TACTGTGCTT 
ACCAGGCCGT 
TCGGTTGTGT 
CAGCCCAACA 
TGGCTACGCA 
GACCTGCTGC 
GGCTTCCAGC 
CAAACAGATG 
GCTGGTGGCA 
TGCACGCTCT 
AACAGCATGA 
CCTGTGGTGC 
CTCGATCACC 
GAGTCAGAGA 
CCCCGGCCCC 
TCCAGAGGAC 
TCCCCTCCGC 
GGCCAGCTGG 
ACAGCCGTGG 
TGCTCCTTAT 
GAGAGGGAAG 
GGGGTGCTGG 
TGGCCTCTTC 
CAGGACCAGC 
CTGCCGTCGT 
GGTCCTGGGC 
GAGAGGACAG 
CAGGGAAGAG 



11 
I 

AGCCGAGGCT 
TTTGCCAGGG 
GCCCCCGGGG 
AGCCACCTGA 
CGATGGGCCT 
TGCTGCTGCT 
GCCCCCGGAT 
AACACATCTC 
GTGCTCGAGA 
ACCAGGAGCT 
AGGACCCACA 
ACCTGTTCAC 
ACTTCACCCT 
GTCCCTTCGA 
GAACAGTCAG 
CCACCAAGAC 
ACATTCCTGA 
GCGAGACTGG 
TCTGCAAGGG 
AGGCCCAGCT 
TCTTCACGCT 
CTTCCCAGTG 
ATGTGCAGAG 
ACACCGTGAC 
GGGAAAGGAA 
ACCACTTCCT 
GCTACCAGCG 
TCCTGGGCAC 
TCATTGAGGA 
CCCACAGGGG 
ACTGCAGCCT 
GGAGCGGCTC 
GGATCCAGGA 
CCCCGTCTTT 
CAGTGAACAC 
ACGGGGCCCC 
TGGTGGGCAC 
AGCTGGTAGC 
AGGGTGGCAG 
AGGCCAGCTG 
TTGTGCTGGC 
AAGTCTTCCT 
TGCCCCCTGA 
GAGGGTACCA 
AGAGGCCACT 
GGGTCCGCCT 
GCTGCCCTGG 
TCTGCTCTTC 
CCTGCTGCTC 
CCCCAGAGGT 
GTAAACTGAG 
AGATAGCATG 
GGATGCATCC 
ACCTTCCACA 
TTGGGCTGCG 
CCCACCACCT 
TCGGACCCAA 
CGCGAGCTCA 
ACTGTCGCCT 



21 
1 

GCGGGGCCGG 
CCCACTTGAC 
CGACTCGGGG 
GCCCGAGCCG 
GAGGAGCTGG 
CCTGCTGCTG 
CAGCCTGCCT 
CAACTACACA 
GGCCCTCTTT 
GCTTTGGGGT 
GCGCGACTGT 
CTGTGGCACA 
GGCAAGGGAC 
CCCGAATTTC 
CAGCTTCCAA 
CGAGAGCTCC 
GAGCCTGGGC 
CCAGGAATTT 
CGATGAGGGT 
GCTGTGCTCA 
GAGCCCCAGC 
GCACAGGGGA 
AGTCTTCAGC 
CCACCCGGTG 
GATCAACTCA 
GATGGACGGG 
CGTGGCTGTA 
TGGTGACGGC 
GCTGCAGATC 
GCTGCTGTAT 
GTACAGGAGC 
CAGCTGCAAG 
CATCGAGGGA 
TGTACCAACA 
TTTGGCCTGC 
CGTCAATGCC 
CCAACAGCTG 
CAGCTACTGC 
TGTACCCGTC 
GGGTGCAGAC 
CGTGCTGCTC 
GAAGCAGGGG 
GACCCGCCCA 
GTCCCTGTCA 
CAGCATCCAA 
TGGCTCGGAG 
CTTCAGGGGC 
GTGGAACACG 
TCCAGTCAAG 
CCTGGCCAAA 
CCCTTTGTTT 
GCATGCAGCA 
AAAGTGGTTG 
TTATCCCGCT 
TGCGTTCTGC 
CAGGGACCAG 
CTCCTGGACC 
GGAGAGATTT 
GCCTTCCTCC 



31 
I 



41 



CGCCGGCGGG 
CCTGTTTCCC 
GCGGACCGCG 
CGGGACACCG 
CTCGCCGCCC 
CTGCTCCTGC 
CTGGGCTCTG 
GCCCTTCTGC 
GCACTCAGTA 
GCAGACGCAG 
CAAAACTACA 
GCAGCCTTCA 
GAGAAGGGGA 
AAGTCCACTG 
GGGAATGACC 
CTCAACTGGC 
AGCTTGCAAG 
GAGTTCTTTG 
GGAGAGCGGG 
CGGCCCGACG 
CCCCAGGACT 
ACTACAGAAG 
GGCCTCTACA 
CCCACACCCC 
TCCCTGCAGC 
CAGGTCCGAA 
CACCGCGTCC 
CGGCTCCACA 
TTCTCATCGG 
GCGGCCTCAC 
TGTGGGGACT 
CACGTCAGCC 
GCCAGCGCCA 
GGGGAGAAGC 
CCGCTCCTCT 
TCGGCCTCCT 
GGGGAGTTCC 
CCAGAGGTGG 
ATTATCAGCA 
AGGTCCTACT 
CCAGTTTTAT 
GAATGTGCCA 
CTCAACGGCC 
GACAGCCCCC 
GACAGCTTCG 
ATCCGTGACT 
TGTGAATGCT 
ACCGTGGTGC 
TAGCGAAGCT 
TATGGGGGCC 
AAAAAACAAT 
CACACGGCTG 
TCTGAGACAG 
GCCACCGGCT 
CTTGCCAGTC 
AGGGCTAGGT 
TTTCCAGCCT 
CGTGACAATG 
GTTGTTGCGT 



AGGACTGCGG 
ACCTCCCGCC 
GGGCGGAGCT 
TCGCTCCTGC 
CATGGGGCGC 
TGCAGCCGCC 
AAGAGCGGCC 
TGAGCAGGGA 
GCAACCTCAG 
AGAAGAAACA 
TCAAGATCCT 
GCCCCATGTG 
ATGTCCTCCT 
CCCTGGTGGT 
CGGCCATCTC 
TGCAAGACCC 
GCGATGATGA 
AGAACACCAT 
TGCTACAGCA 
ATGGCTTCCC 
GGCGTGACAC 
GCTCTGCCGT 
AGGAGGTGAA 
GGCCTGGAGC 
TCCCAGACCG 
GCCGCATGCT 
CTGGCCTGCA 
AGGCAGTGAG 
GACAGCCCGT 
ACTCGGGCGT 
GCCTCCTCGC 
TCTACCAGCC 
AGGACCTTTG 
CATGTGAGCA 
CCAACCTGGC 
GCCACGTGCT 
AGTGCTGGTC 
TGGAGGACGG 
CATCGCGTGT 
GGAAGGAGTT 
TCTTGCTCTA 
GCGTGCACCC 
TAGGGCCCCC 
CGGGGTCCCG 
TGGAGGTATC 
CTGTGGTGTG 
CGGAGAGGGT 
CCGGCCCTTG 
CCTACCACCC 
TGCCTAGGTT 
TCCAAATGTG 
CTCCAGTTCA 
AGTTGGAAAC 
GCCCTGTCTC 
AGCOGAGGAT 
TGGCACTGCG 
GTATCAGGCT 
TACGCCTTTC 
GAGAACCCG7 
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TGCCCCGCGG 
CCCCAGGTCC 
GCCGCCCGTG 
TCTCCGAATG 
GCTGCCGCCT 
GCCTCCGACC 
ATTCCTCAGA 
TGGCAGGACC 
CTTCCTGCCA 
GCAGTGCAGC 
CCTGCCGCTC 
TACCTACATC 
GGAAGATGGC 
TGATGGCGAG 
GCGGAGCCAA 
AGCTTTTGTG 
CAAGATCTAC 
TGTGTCCCGC 
GCGCTGGACC 
CTTCAACGTG 
CCTTTTCTAT 
CTGTGTCTTC 
CCGTGAGACA 
GTGCATCACC 
CGTGCTGAAC 
GCTGCTGCAG 
CCACACCTAC 
CGTGGGCCCC 
GCAGAATCTG 
AGTCCAGGTG 
CCGGGACCCC 
TCAGCTGGCC 
CAGCGCGTCT 
AGTCCAGTTC 
GACCCGACTC 
ACCCACTGGG 
ACTAGAGGAG 
GGTGGCAGAC 
GAGTGCACCA 
CCTGGTGATG 
CCGGCACCGG 
CAAGACCTGC 
TAGCACCCCG 
AGTCTTCACT 
CCCAGTGTGC 
AGAGCTGACT 
CAACTGGACC 
GGAGCCTTGG 
AGACACCCAA 
GGTGGAACAG 
AAACTAGAAT 
TGGCCTCCCA 
OCTCACCAAC 
ACTGCAGATT 
GTAGTTGTTG 
GCCCTCACCA 
GTGGCCACAC 
CCTCAGAATT 
GTGCCCCTTC 
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120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
' 1020 
1080 
1140 
1200 
1260 
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1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
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CCACCATATC CACCCTCGCT CCATCTTTGA ACTCAAACAC GAGGAACTAA CTGCACCCTG 3600 

GTCCTCTCCC CAGTCCCCAG TTCACCCTCC ATCCCTCACC TTCCTCCACT CTAAGGGATA 3660 

TCAACACTGC CCAGCACAGG GGCCCTGAAT TTATGTGGTT TTTATACATT TTTTAATAAG 3720 
ATGCACTTTA TGTCATTTTT TAATAAAGTC TGAAGAATTA CTGTTT 

Seq ID NO: 206 Protein Sequence 
Protein Accession #: XP_044533.6 

1 11 21 31 41 51 

i I I I I I 

MLRTAMGLRS WLAAPWGALP PRPPLLLLLL LLLLLQPPPP TWALSPRISL PLGSEERPFL 60 

RFEAEHISNY TALLLSRDGR TLYVGAREAL FALSSNLSFL PGGEYQELLW GADAEKKQQC 120 

SFKGKDPQRD CQNYIKILLP LSGSHLFTCG TAAFSPMCTY INMENFTLAR DEKGNVLLED 180 

GKGRCPFDPN FKSTALWDG ELYTGTVSSF QGNDPAISRS QSLRPTKTES SLNWLQDPAF 240 

VASAYIPESL GSLQGDDDKI YFFFSETGQE FEFFENTIVS RIARICKGDE GGERVLQQRW 300 

TSFLKAQLLC SRPDDGFPFN VLQDVFTLSP SPQDWRDTLF YGVFTSQWHR GTTEGSAVCV 360 

FTMKDVQRVF SGLYKEVNRE TQQWYTVTHP VPTPRPGACI TNSARERKIN SSLQLPDRVL 420 

NFLKDHFLMD GQVRSRMLLL QPQARYQRVA VHRVPGLHHT YDVLFLGTGD GRLHKAVSVG 480 

PRVHIIEELQ IFSSGQPVQN LLLDTHRGLL YAASHSGWQ VPMANCSLYR SCGDCLLARD S40 

PYCAWSGSSC KHVSLYQPQL ATRPWIQDIE GASAKDLCSA SSWSPSFVP TGEKPCEQVQ 600 

FQPNTVNTLA CPLLSNLATR LWLRNGAPVN ASASCHVLPT GDLLLVGTQQ LGEFQCWSLE 660 

EGFQQLVASY CPEWEDGVA DQTDEGGSVP VIISTSRVSA PAGGKASWGA DRSYWKEFLV 720 

MCTLFVLAVL LPVLFLLYRH RNSMKVFLKQ GECASVHPKT CPWLPPETR PLNGLGPPST 780 
PLDHRGYQSL SDSPPGSRVF TESEKRPLSI QDSFVEVSPV CPRPRVRLGS EIRDSW 
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It is understood that the examples described above in no way serve to limit the true 
scope of this invention, but rather are presented for illustrative purposes. All publications, 
sequences of accession numbers, and patent applications cited in this specification are herein 
incoiporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 



303 



WO 03/003906 



PCT/US02/21338 



WHAT IS CLAIMED IS: 

1 . A method of detecting a bladder cancer-associated transcript in a cell from a patient, 
5 the method comprising contacting a biological sample from the patient with a polynucleotide 

that selectively hybridizes to a sequence at least 80% identical to a sequence as shown in 
Tables 1A-13. 

2. The method of Claim 1 , wherein the biological sample comprises isolated nucleic 
10 acids. 

3. The method of Claim 2: 

a) wherein the nucleic acids are mRNA; or 

b) further comprising the step of amplifying nucleic acids before the step of contacting 
1 5 the biological sample with the polynucleotide, 



4. The method of Claim 1 , wherein the polynucleotide: 

a) comprises a sequence as shown in Tables 1A-13; or 

b) is immobilized on a solid surface. 

20 

5 . The method of Claim 1 , wherein the patient is: 

a) undergoing a therapeutic regimen to treat bladder cancer; or 

b) suspected of having bladder cancer. 

25 6. An isolated nucleic acid molecule consisting of a polynucleotide sequence as shown 
in Tables 1A-13. 



7. The nucleic acid molecule of Claim 6, which is labeled. 
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8. An expression vector comprising the nucleic acid of Claim 7. 
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9. A host cell comprising the expression vector of Claim 8. 

1 0. An isolated polypeptide which is encoded by a nucleic acid molecule having 
polynucleotide sequence as shown in Tables 1A-13. 

11. An antibody that specifically binds a polypeptide of Claim 1 0. 

12. The antibody of Claim 1 1 , further conjugated to an effector component. 

13. The antibody of Claim 12, wherein the effector component is a fluorescent label. 

14. The antibody of Claim 1 2, wherein the effector component is a radioisotope or a 
cytotoxic chemical. 

15. The antibody of Claim 1 1 , which is 

a) an antibody fragment; or 

b) a humanized antibody 

1 6. A method of detecting a bladder cancer cell in a biological sample from a patient, the 
method comprising contacting the biological sample with an antibody of Claim 11. 

1 7. The method of Claim 1 6, wherein the antibody is further conjugated to an effector 
component 

18. The method of Claim 1 7, wherein the effector component is a fluorescent label. 

1 9. A method for identifying a compound that modulates a bladder cancer-associated 
polypeptide, the method comprising the steps of: 
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a) contacting the compound with a bladder cancer-associated polypeptide, the 
polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at 
least 80% identical to a sequence as shown in Tables 1A-13; and 

b) determining the functional effect of the compound upon the polypeptide. 

20. A drug screening assay comprising the steps of 

a) administering a test compound to a mammal having bladder cancer or a cell isolated 
therefrom; 

b) comparing the level of gene expression of a polynucleotide that selectively hybridizes 
to a sequence at least 80% identical to a sequence as shown in Tables 1 A- 13 in a 
treated cell or mammal with the level of gene expression of the polynucleotide in a 
control cell or mammal, wherein a test compound that modulates the level of 
expression of the polynucleotide is a candidate for the treatment of bladder cancer. 
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